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1. INTRODUCTION 


For several years, the Earth Observations Division (EOD) of the 
National Aeronautics and Space Administration (NASA) , Lyndon B. 
Johnson Space Center (JSC) , has supported research for the 
development of techniques to be used in processing remotely 
sensed imagery data obtained from the multispectral scanner (MSS) 
placed aboard various aircraft and satellites. One of the 
earliest operational computer systems to use pattern recognition 
techniques in the analysis of these data was developed at Purdue 
University's Laboratory for Applications of Remote Sensing 
(LARS) . The earliest version of the LARS system (LARSYS) was 
converted in 1970 to a batch program for execution on the 
Univac 1108 EXEC 2 system at NASA/JSC (ref. 1). 

The computer system described in this document originated from 
this early version of the LARSYS. However, since 1970, per- 
sonnel of the EOD, Lockheed Electronics Company, Inc. (LEC) , 
and other EOD support contractors have made many modifications 
and improvements to the Univac 1108 version of the LARSYS; thus, 
new techniques have been developed and programmed to perform 
additional functions in the evaluation of the data. 

Although the basic structure of the system remains the same, 
a large portion of it has been reprogrammed. Modifications to 
existing techniques and the addition of new techniques have 
expanded the capabilities of the system. The current version 
is called the EOD-LARSYS . Recently, it has been converted from 
Univac 1108 Fortran to IBM 370/148 Fortran. The EOD LARSYS 
system is now operational on the IBM 370/148 computer located 
at LARS Purdue University, West Lafayette, Indiana. 

It may be accessed both on and off the JSC site by remote 
terminal. 


The purposes of this document are to define the capabilities 
and limitations of the system and to provide the user with 
the information needed to execute the program and obtain the 
output desired. It is assumed throughout the document that 
the user is familiar with the terminology and the pattern 
recognition techniques involved. No attempt is made to assist 
the user in the analysis of data output by the system. 

This EOD-LARSYS User Guide for the IBM 370/148 is being 
issued in four volumes: 

Volume I — System Overview, August 1978 (JSC 13821; 

LEC 12563) 

Volume II — User's Reference Manual, December 1978 

(JSC 13821; LEC 12564) (this document) 

Volume III — "As-Built" Design Specification 
(to be issued) 

Volume IV — Program Listings (to be issued) 



2. GENERAL SYSTEM DESCRIPTION 


The EOD-LARSYS is a batch processing program operational on 
the IBM 370/148 system at Purdue University. The system is 
composed of a system monitor and a set of processors, each of 
which performs a specific function in the analysis of MSS 
imagery data. Linkage between processors is accomplished by 
the use of files in the computer; by files on disk or tape; or, 
less commonly, by card decks. The execution of a particular 
batch job may begin or end with any processor, provided the 
appropriate files are furnished. 

The data are preprocessed by the Earth Resources Interactive 
Processing System (ERIPS) . (ERIPS merges tapes received from 
Goddard Space Flight Center and creates a multi-temporal/multi- 
pass tape.) 

Two pattern recognition classification schemes are provided by 
the system. One, the supervised classification algorithm known 
as the maximum likelihood classifier, is embodied in the CLASSIFY 
processor (ref. 2) . The other, an unsupervised classification 
or clustering algorithm, is embodied in the Iterative Self- 
Organizing Clustering System (ISOCLS) processor (ref. 3) . 

ISOCLS, along with other processors, may be used to "train" 
the maximum likelihood classifier or to display the results 
of classification. 

Having obtained an MSS image data tape (DATAPE) in one of the 
allowable formats (see section 3.2), the data analyst must 
train the classifier. The maximum likelihood classification 
algorithm is based on the assumption that the samples within 
a given class are distributed according to a multivariate 
normal probability density function. Such distribution is 
specified completely in terms of a mean vector and a covariance 


matrix, which must be computed from known samples of the class 
being represented. This implies that the data analyst must 
have some prior knowledge (i.e., ground-truth information) of 
specific areas within the MSS image. Using this ground truth, 
the analyst must identify training samples for computation of 
statistics. The histogram (HIST) and gray map (GRAYMAP) proc- 
essors may be used to aid the analyst. 

The HIST processor provides a histogram of data values from the 
MSS image for use by the GRAYMAP processor. HIST may also be 
used independently to provide the analyst with information on 
the distribution of data values within specific user-defined 
blocks (or fields) of the image. The mean, standard deviation, 
and range of data within each user-defined field are standard 
outputs from the HIST processor for each requested channel. 
Histogram plots may be obtained optionally. With the histogram 
information, a file (HISFIL) is written automatically for the 
GRAYMAP proce s sor . 

The GRAYMAP processor provides the analyst with a pictorial 
gray-scale map of any channel of the MSS image for use in 
obtaining training field coordinates. The map is labeled by 
sample and scan-line numbers. From this map, the analyst may 
locate the fields within the image for which he has ground- 
truth data. Having identified the fields, coordinates (sample 
and scan-line numbers at each vertex) must be noted for defining 
the fields to the statistics (STAT) or ISOCLS processor. 

Alternatively, the analyst may proceed using Procedure 1 
methodology. Using an ERIPS tape as input to the DOTDATA 
processor, the analyst writes a dot data file. A dot data file 
contains both type 1 and type 2 dots. Type 1 dots are used 
both as starting vectors for the clustering processor (ISOCLS) 
and as labeling vectors for the labeling processor (LABEL) . 


Type 2 dots are used as a bias correction factor in computing 
the classification results output by the DISPLAY processor. 

The coordinates for training fields may be input to either 
the STAT or ISOCLS processor for computation of statistics for 
the classifier. The dot data file is input to ISOCLS to compute 
statistics. Both processors save the statistics and training 
field or dot information on a file (SAVTAP, section 4.1) for 
use in other processors. 

In using the STAT processor, the user must group training fields 
into statistically similar subclasses. Subclasses may be grouped 
further into classes. For example, three statistically similar 
subclasses of spring wheat, winter wheat, and harvested wheat 
may be grouped into one wheat class. Statistics for each sub- 
class are maintained on the SAVTAP file, along with the class 
grouping. Class groupings are maintained simply for convenience 
in defining categories in the CLASSIFY processor and for perform- 
ance reports by the DISPLAY processor. The analyst may obtain 
the following output for each training field and/or subclass. 

• Mean vector 

• Covariance matrix 

• Correlation coefficient 

• Histogram plots 
a » Spectral plots 

In using the ISOCLS processor with training classes, the user 
must group the training fields into classes. The clustering 
process breaks the class data into statistically similar sub- 
classes (clusters) . Subclasses are given names by taking the 
first two characters of the class name and two digits indicating 




the number of the subclass within the class. Again, the statis- 
tics for each subclass are saved on the SAVTAP file for use in 
other processors. ISOCLS is an iterative self-organizing clus- 
tering procedure which uses the measure of absolute (LI) dis- 
tance from a picture element (pixel) to the cluster center to 
determine the similarity of pixels. At each iteration the user 
may obtain a cluster summary and map. Optionally, a cluster 
image data tape (MAPUNT) may be output in either LARSYS III or 
Universal format (appendixes B and C, respectively) . 

Under Procedure 1, designated other/designated unidentifiable 
(DO/DU) fields are delineated by card input. ISOCLS clusters 
the LACIE segment using the starting vectors from the dot data 
file to initialize the clustering process. Sun angle correction 
is provided. An unconditional cluster map and a set of 
"unlabeled " statistics are output. 

The "unlabeled" statistics, cluster map, and dot data file are 
input to the LABEL processor. Using one of two procedures, 
k-nearest-neighbor or all-of-a-kind, the statistics are 
"labeled." A conditional or mixed cluster map may be output 
to be displayed later on the Passive Microwave Imaging System/ 
Data Analysis Station (PMIS/DAS) or the Interactive Multispectral 
Image Analysis System, model 100 (Image 100) . 

By the use of the transform statistics (TRSTAT) processor, the 
statistics on the SAVTAP file output by the processor STAT or 
ISOCLS may be transformed according to 


where 

y ' = transformed means 

A Fa matrix 

c 


U ' = Ay + b 
K' = AKA T 


2s/ 


( 2 - 1 ) 


U = means from SAVTAP file 


b = a vector 

K' = transformed covariance matrix 

K = covariance matrix from SAVTAP file 
T 

A = transpose of matrix A 

The transformed statistics are output as a new file on SAVTAP. 

Before proceeding to classification of the MSS image, it may be 
desirable to reduce the dimensionality of the data vectors by 
selecting a smaller set of channels or a linear combination of 
the channels which maximizes some class separability measure. 

The SELECT processor (ref. 4) provides this capability. In 
order to compute the value of the separability measures, the 
statistics calculated using the STAT or ISOCLS processor must 
be made available to the SELECT processor by card deck, tape, 
or disk file, usually the last. The SELECT processor allows 
the analyst to work with subsets of the statistics on the file, 
if he desires. Subsets of the statistics are indicated by the 
CHANNELS and SUBCLASS control cards, which are defined further 
in section 10.4.3. 

In addition, the statistics for two or more subclasses may be 
grouped together and considered as one subclass. Grouping the 
statistics for two or more subclasses is equivalent to going 
back through the STAT processor and combining all training fields 
for those subclasses being grouped into one subclass. The group- 
ing option is exercised via the GROUP control card defined in 
table 10.1. The subsets and groupings of the statistics pro- 
vided to the SELECT processor for computation are used only in 
SELECT and are not passed on to other processors. 


2 ^ 

7 


The SELECT processor also allows the analyst to evaluate a 
given set of channels using one of three different separability 
measures or to select the best set of channels (k) out of the 
total channels (n) based on one of these separability measures. 
The three separability measures provided are: 

a. Weighted average interclass divergence 

b. Weighted average transformed divergence 

c. Weighted average Bhattacharyya distance 

To select the best set of k out of n channels, the analyst may 
use either the Without Replacement Procedure or the Exhaustive 
Search Procedure. A third procedure will find k linear combina 
tions of n measurements which extremize a given separability 
measure. This procedure, known as the Davidon-Fletcher-Powell 
Procedure, outputs the linear combinations in matrix form. All 
the procedures and equations for separability measures referred 
to above are discussed in detail in reference 5. 

After the SELECT processor has determined the subset or the 
linear combination of channels which maximizes subclass separ- 
ability, the supervised classification of the imagery data is 
performed by the CLASSIFY processor. The options available in 
SELECT for grouping and selecting subsets of the SAVTAP file 
are also available in CLASSIFY. 

However, once the statistics for classification have been speci 
fied, the classes and subclasses are renumbered and referred to 
in the DISPLAY processor by the new numbers. 

The CLASSIFY processor allows the user to group classes previ- 
ously defined by the STAT or ISOCLS processor into categories 
for the sum-of-densities classification. When a category is 


defined (by class names) , all subclasses in each class are 
assigned to the category. The density function for category i, 

P ± (x) , is the sum of densities for all subclasses in the cate- 
gory; that is, 

k i 

p i ( x) = 2 P j (x) (2-2) 

j=l 

where 

Pj = the probability density function for subclass j 
= the number of subclasses in category i 
(Note: More detailed equations are given in section 11.) 

Pixel x is assigned to category i if P^ (x) > P^ (x) for all 
categories k ^ i. Pixel x is further assigned to subclass j 
if (1) j belongs to category i and (2) Pj (x) > p^fx) for all 
subclasses k, k ^ j, and k belongs to category i. 

Obviously a one-to-one correspondence between categories and 
subclasses reduces the above equation to: 

P i (x) = Pj (x) (2-3) 

When this is the case, the amount of computation required for 
classification can be greatly reduced by the use of thresholds. 
CLASSIFY then has two procedures for classification which use 
this computational reduction to advantage. The sum-of-densities 
rule is used only when categories are defined by the user. 
Otherwise, the classification-by-thresholding procedure detailed 
in reference 5 is used. 

The CLASSIFY processor writes a file (MAPTAP, section 4.4 and 
appendix C) containing the subclass number and confidence level 
for each pixel classified; the training fields and statistics for 



the classes and subclasses actually used in classification; and 
the correspondence between categories, classes, and subclasses. 

The DISPLAY processor accepts the file output by CLASSIFY and 
generates a line-printer map of the classified date"*, along 
with several performance tables. In the map, each subclass has 
a symbol associated with it. A threshold option is provided 
for the analyst to print no symbol (blank) for samples classi- 
fied with a confidence level less than some specified 
threshold value. 

Performance summaries are provided on subclass, class, and 
category levels for pixels within each classified field, 
training field, and test field which are input to the DISPLAY 
processor. The training or test field performance summaries 
may be obtained by fields and/or classes. The DISPLAY proc- 
essor also provides optional output of a classification map 
(MAPUNT) on tape in either Universal or LARSYS III format. 

The data-transformation (DATA-TR) processor allows the analyst 
to use the linear transformation matrix computed by SELECT to 
create a new image data tape (TRFORM) . Since the matrix is 
computed to extremize subclass separability, the k linear 
combinations out of n channels represented by the matrix pro- 
duce better class contrast when the image is displayed; that 
is, on the DAS. In addition, the best linear combination of the 
data can be used to enhance the image. 

The TRFORM tape may be output in either the LARSYS III or 
Universal format. 

The NDHIST processor performs an n-dimensional histogram of 
areas on the MSS data tape (DATAPE) , for which the user wishes 


to create scatter plots. The fields may be histogrammed on a 
class, subclass, or per-field basis. A line-printer summary 
of the fields, the number of data vectors in each field, and 
the number of unique data vectors histogrammed is given. 

Optionally, if a scatter plot of a classified or clustered area 
is requested, a classification or cluster image data tape 
(MAPUNT) from the DISPLAY or ISOCLS processor must be input 
to NDHIST. If this option is exercised, the field or fields 
input to this processor and their order of input must be the 
same as those input to CLASSIFY or ISOCLS. 

Information such as the field, cluster or subclass number, the 
frequency of occurrence, and color code for each histogrammed 
radiance vector is written on the n-dimensional histogram 
(NHSTUN) file. 

The SCTRPL processor reads the NHSTUN file, and a two-axis 
color-coded spectral plot (SCTRUN) is output in the Universal 
format. The background for the plot may be black or white. 

If more than two channels were histogrammed by the NDHIST 
processor, the data vector is reduced to two components by 

y' = Ax + b (2-4) 

where 

y' = transformed image 
A = matrix 

x = data vector 

( 

b = bias vector 


// 


Columns 


Definition 


11-15 

16-20 

21-25 

26-30 


Type/format 

Integer/15 

Integar/I5 

Integer/15 

Integer/15 


Sample number of first vertex 
Line number of first vertex 
Sample number of second vertex 
Line number of second vertex 


76-80 Integer/15 Line number of the seventh vertex 

• Card type 6 — Class names, nine names per card, left 

justified in field: The number of cards is determined by 

the number of classes. 

Columns Type/format Definition 


11-16 

Alphanumeric/ 

Four-character 

class 

name 

for 

first 


A4 

class 





19-24 

Alphanumeric/ 

Four-character 

class 

name 

for 

second 


A4 

class 





27-32 

Alphanumeric/' 

Four-character 

class 

name 

for 

third 

• 

A4 

• 

class 

t 




• 

• 

75-80 

• 

• 

Alphanumeric/ 

• 

• 

Four-character class 

name 

for 



A4 

ninth class 






• Card type 7 — Number of subclasses in each class, 24 per card: 
The number of cards is determined by the number of subclasses. 


Columns 

Type/format 



Definition 


9-10 

Integer/12 

Number 

of 

subclasses 

in 

first 

class 

12-13 

Integer/12 

Number 

of 

subclasses 

in 

second 

class 

15-16 

• 

Integer/12 

• 

Number 

of 

subclasses 

• 

in 

third 

class 

• 

78-79 

• 

Integer /I 2 

Number 

of 

• 

subclasses 

in 

24th class 
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Card type 8 — Subclass names, 10 per card: The number of 

cards is determined by the number of subclasses. 

Definition 


Columns Type/format 

9-14 Alphanumeric/ 
A4 

16-21 Alphanumeric/ 
A4 

23-28 Alphanumeric/ 
A4 


Four-character subclass name for 
first subclass 

Four-character subclass name for 
second subclass 

Four-character subclass name for 
third subclass 


72-77 


Alphanumeric/ Four-character subclass name for 
A4 10th subclass 


To complete the set of statistics for one subclass, the following 
three types of cards are grouped together. The number of sets of 
cards is determined by the number of subclasses. 

• Card type 9 — Number of points in this subclass. 

Columns Type/format Definition 

13-20 Integer/18 Number of points in this subclass 

• Card type 10 — Mean vector for this subclass, five values 

per card: The number of cards is determined by the number 

of channels. 


Columns 

Type/ format 



6-20 

Real/El 5. 8 

Mean 

for 

21-35 

Real/El 5. 8 

Mean 

for 



subclass 

36-50 

Real/E15 . 8 

Mean 

for 


Definition 

first channel for this subclass 
second channel for this 

third channel for this subclass 
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Columns Type/format Definition 

51-65 Real/El,5.8 Mean for fourth channel for this 

subclass 

66-80 Real/El5.8 Mean for fifth channel for this subclass 

• Card type 11 — Covariance matrix for this subclass: Only 

the lower triangular portion of the matrix is output; the 
number of values input for this matrix is equal to (number 
of channels) x (number of channels + l)/2. Five values are 
written on each card image in the order indicated. 

"l 

2 3 

4 5 6 

7 8 9 10 


Columns Type/format 
6-20 Real/E 15. 8 

21-35 Real/El 5. 8 

36-50 Real/El 5. 8 
51-65 Real/El5. 8 
66-80 Real/El5 . 8 


Definition 
Element 1 of matrix 
Element 2 of matrix 
Element 3 of matrix 
Element 4 of matrix 
Element 5 of matrix 


3 . 1 . 4 . 2 B-Matrix File 

This file is an optional output of the SELECT processor when the 
Davidon-Fletcher-Powell Procedure is used. The file contains a 
transformation matrix which extremizes a given separability meas- 
ure for the subclasses being used. The matrix is optimized 




using the Davidon-Fletcher-Powell Procedure. The linear 
transformation of the original measurements can be used 
in the CLASSIFY, SCTRPL, TRSTAT, or DATA-TR processor to 
reduce the dimensionality of the data and/or statistics. 

The B-matrix deck, or corresponding file, is an optional 
input to SELECT, SCTRPL, and CLASSIFY and a required input 
to DATA-TR and TRSTAT (see A-MATRIX control card in table 14-1) . 
When input to SELECT, the matrix is used to evaluate a speci- 
fic separability measure or it is used as a first guess for 
the Davidon-Fletcher-Powell Procedure, depending on the user's 
request. When input to CLASSIFY, classification is performed 
using the linear transformation. When input to SCTRPL, the 
dimension of the data from the MSS data tape (DATAPE) is 
reduced to two linear combinations. When input to TRSTAT, 
a new file containing the transformed statistics is created 
on SAVTAP. The DATA-TR processor uses the matrix to create 
a new image file with the reduced dimensionality. 

The keyword B-MATRIX on a control card indicates that the 
B-matrix is being input. Since the matrix may be on cards or 
be a disk file, the parameter CARDS or FILE must be specified 
on the same card in columns 11 through 72. The entire file 
is defined below by card types. 

• Card type 1 — The keyword B-MATRIX in columns 1 through 10 
and CARDS or FILE in columns 11 through 72 initialize 
input of the file. 

• Card type 2 — One card of this type. 

Columns Type/format Definition 

6-7 Integer/12 Number of linear combinations. 

13-14 Integer/12 Number of channels. 




Columns Type/format Definition 

18-80 Integer/3012 The remainder of this card lists by 

number the channels for which the 
matrix was computed; e.g., columns 
18 through 19, first channel, etc., 
for a maximum of 30 channels right 
justified in the field. 


• Card type 3 — The number of these cards is determined by the 
size of the matrix. The values are input by column as indi- 
cated below, five values per card. 

B(k,n), k = linear combinations; n = channels 


1 (k + 1) ••• 

2 (k + 2) ••• 

3 (k + 3) ••• 


[nk - (k - *1)1 
[nk - (k - 2)] 
[nk - (k - 3)] 


Columns 

6-20 

21-35 

36-50 

51-65 

66-80 


Type/ format 
Real/El 5. 8 
Real/E 15. 8 
Real/E 15. 8 
Rea 1/El 5 . 8 
Real/El 5. 8 


2k nk 

Definition 

Element 1 of matrix 

Element 2 of matrix 

Element 3 of matrix 

Element 4 of matrix 

Element 5 of matrix 
(Continued on next card) 


3. 1.4. 3 Cluster Means File 

This file is an optional input to the clustering processor 
ISOCLS. It may be used to initialize the clustering process 
by estimating cluster centers (means) . The means can be taken 


from the module STAT file (see section 3.1. 4.1) created by 
either TRSTAT, STAT, ISOCLS, or the user. Means may be input 
for up to 30 channels for each cluster center, and a subset of 
the channels to be used may be indicated on the CHANNELS 
control card. 

The keyword MEANS in the control cards for ISOCLS indicates 
initial cluster means are being input. Since the means may be 
input on cards or from a disk file, the keyword CARDS or FILE 
must be punched on the same card in columns 11 through 72. If 
on cards, CARDS initializes input of the cluster means deck 
which must immediately follow. The format for the entire file 
is indicated below. 

• Card type 1 — Control card keyword MEANS is left justified 
in columns 1 through 5. The keyword CARDS in columns 11 
through 72 initializes input of the card deck. 

• Card type 2 — Number of clusters and channels. 

Columns Type/format Definition 

6-10 Integer/15 Number of initial clusters' for which 

means are provided 

25-30 Integer/15 Number of channels for which means are 

provided 

• Card type 3 — Actual channels used in computation of means. 

Columns Type/format Definition 


6-7 

Integer/12 

Channel 

1 

8-9 

Integer/12 

Channel 

2 

10-11 

• 

Integer/12 

• 

• 

Channel 

• 

• 

3 

• 

64-65 

• 

Integer/12 

• 

Channel 

30 


Z>( 


• Card type 4 — Mean vectors for the initial clusters: These 

cards are in the same format as 'the means cards (card type 10) 
in the module STAT file. The first mean for each cluster 
always begins on a new card. The number of cards depends 
on the number of channels and the number of clusters. 

Five values are placed on each card. 


Columns 

Type/format 

Definition 


6-20 

Real/El 5. 8 

Mean 

for 

channel 

1 

21-35 

Real/El 5. 8 

Mean 

for 

channel 

2 

36-50 

Real/El5 . 8 

Mean 

for 

channel 

3 

51-65 

Real/El 5. 8 

Mean 

for 

channel 

4 

66-80 

Real/E15 . 8 

Mean for channel 5 
(Continued on consecutive 
cards of the same format) 


3.2 MSS IMAGE DATA TAPES 

Every processor except SELECT, LABEL, TRSTAT , and SCTRPL uses 
an MSS data tape (DATAPE) . The tape assignment defaults to 
logical unit 11, but the user may assign any unit available 
by input of the DATAFILE control card. For details, see the 
file assignment chart in section 5 and the control card section 
for each processor. The tape may be in either the LARSYS III 
format or the Universal format. These formats are defined in 
appendixes B and C, respectively. 

The control card DATAFILE allows the user to communicate the 
file number of the MSS data tape (DATAPE) to be processed and 
the logical unit assignment. This is optional input to every 
processor that requires the MSS data tape. The first file of 
the tape will be processed unless otherwise specified by the 
DATAFILE control card. In executing the same and/or different 
processors back to back, the DATAFILE control card may be input 
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only to the first processor executed if the same file and logical 
unit are to be used throughout the execution. For example, 


$HIST 

DATAFILE UNIT=11 ,FILE=2 
: (Other control cards) 

*END 

(Field definition) 

$END 

$GRAYMAP 
CHANNELS 5 , 6 

*END 

(Field definitions) 

$END 


[File 2 of the MSS data tape 
(DATAPE) assigned to unit 11 
is processed by GRAYMAP as 
well as HIST.] 
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4. SYSTEM INTERNAL FILES 


The files described in this section are used internally by the 
system to pass information between processors. It is the user's 
responsibility to assign the necessary files for his particular 
job. In the discussion that follows, the names of the units on 
which the files are written are also used to identify the files. 


4.1 STATISTICS FILE (SAVTAP, UNIT 20) 

This file must be assigned either to disk or to tape, normally 
the former, whenever one or more of the processors STAT, SELECT, 
CLASSIFY, ISOCLS , LABEL, NDHIST, SCTRPL, TRSTAT , or DATA-TR is 
executed. One file is written on this unit for each execution 
of STAT, TRSTAT, or ISOCLS or for input of a module STAT file 
to some other processor. The file contains the same information 
as itemized in section 3. 1.4.1 for the module STAT file. 


Multiple files may be written on a single unit, usually disk 
or tape, and may be accessed by using the STATFILE control card. 
This control card communicates the file number for positioning 
the unit and the logical unit assignment. The first file is 
always assumed unless otherwise specified by the user, and the 
unit assignment assumes logical unit 20 unless otherwise speci- 
fied by the STATFILE control card. In executing several proc- 
essors back to back and in referencing the same file, only one 
STATFILE control card need be submitted. If different file 
numbers are to be referenced during one execution, then the 
file number may be changed from one processor to the next by 
input of the STATFILE control card to each processor. 

For example, 

$STAT 

STATFILE UNIT=20,FILE=2 
(Other control cards) 

*END 

(Class, subclass, and field definitions) 



$END 

$CLASSIFY 

*END 

(Fields to be classified) 

$END 

The STAT processor will write the training statistics for this 
run on file 2 of the SAVTAP file (unit 20) . (The system files, 
their logical units, and assignments are set out in table 4-1.) 
CLASSIFY will use all of the statistics on file 2 of the tape 
for classification. 


The following example shows assignments for back-to-back execu- 
tions of STAT, ISOCLS, and SELECT. 

$STAT 

STATFILE UNIT=20,FILE=2 
(Other control cards) 

*END 

(Class, subclass, and field definitions) 

$END 

$ISOCLS 

STATFILE UNIT=20,FILE=3 
(Other control cards) 

*END 

(Class and field definitions) 

$END 

$SELECT 

STATFILE UNIT=20,FILE=2 

BEST 4 

*END 

$END 

STAT will write on file 2 of unit 20, ISOCLS will write on 
file 3 of the same unit, and SELECT will go back to file 2 
of unit 20 for the statistics computed by STAT. 


4.2 B-MATRIX FILE (BMFILE, UNIT 10) 

The file written to the BMFILE unit contains the transformation 
matrix which corresponds to the B-matrix file (section 3. 1.4. 2). 
The file is an optional input to SELECT, SCTRPL, and CLASSIFY 
and a required input to DATA-TR and TRSTAT. When the card deck 


is input to any of these processors, this file is automatically 
written. The B-matrix is computed by the SELECT processor and 
is automatically output to the file when the Davidon-Fletcher- 
Powell Procedure is executed. 

4.3 ONE-DIMENSIONAL HISTOGRAM FILE (HISFIL, UNIT 13) 

On logical unit 13, the HIST processor creates the HISFIL file, 
which is used by the GRAYMAP processor. 

4.4 CLASSIFICATION MAP FILE (MAPTAP, UNIT 2) 

The MAPTAP file (appendix D) , which is output by CLASSIFY, 
contains the statistics actually used in the classification, the 
training field information, and all of the classification results. 

4.5 N-DIMENSIONAL HISTOGRAM FILE (NHSTUN, UNIT 4) 

The NDHIST processor writes a file to the NHSTUN unit to be used 
as an interface to the SCTRPL processor. The device default 
assignment is unit 4, but the user may assign any available 
unit. The NHSTUN file format is defined in appendix E. In 
earlier documentation of EOD-LARSYS, the file written on NHSTUN, 
unit 4, is referred to as the NDIM file. 


4.6 TRANSFORMED STATISTICS FILE (SAVTAP, UNIT 20) 

The TRSTAT processor writes the transformed statistics on the 
SAVTAP file. (See section 4.1 for further information.) 


4.7 DOT DATA FILE (DOTUNT, UNIT 19) 

The DOTDATA processor writes a multifile unformatted file on 
the DOTUNT unit. The files contain information extracted 
from the MSS data tape, using all or a subset of 209 possible 
grid points (dots) . The file created on the DOTUNT unit is 
an interface for the processors ISOCLS, LABEL, and DISPLAY. 
The format of the dot data file is defined in appendix H. 



5. SYSTEM OUTPUT FILES 


5.1 CLUSTER MAP FILE (MAPUNT, UNIT 16) 

On logical unit 16 the DISPLAY processor optionally outputs a 
multifile data tape (MAPUNT) containing the subclass number 
to which each corresponding pixel was assigned during classi- 
fication by CLASSIFY. Also, on logical unit 16, the ISOCLS 
processor outputs a file containing either the cluster number 
(OPTION CLUSTER control card) or the mean vector to which each 
corresponding pixel was assigned during clustering. A color 
key containing the color code for each cluster is given for 
the mean vectors. The color codes optionally may be ordered 
according to the cluster number or to greenness (OPTIO.N ORDER 
control card). (See section 9.5.3, table 9-1, for ISOCLS 
control cards . ) 

The results of the classification/clustering may be displayed 
on the PMIS or the Bendix 100 DAS. The tape necessary must be 
mounted on a nine-track tape drive compatible with the DAS 
and may be output in either the LARSYS III or Universal format. 
The display may be made without the color keys (appendices B 
and C) or with color keys (see appendix F for tape format) . To 
exercise this option, see FORMAT control card (table 9-1) for 
the ISOCLS and section 12 (table 12-1) for the DISPLAY processor. 


One file is written on the output tape for each field classified 
or clustered. In earlier documentation of EOD-LARSYS , the file 
written to the MAPUNT unit is referred to as MAPFIL. 


^These data are available as input to NDHIST via saven- or 
nine-track tape or disk. 
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5.2 SCATTER PLOT DATA FILE (SCTRUN, UNIT 12) 


The SCTRPL processor outputs two-axis color-coded spectral plots 
on a multifile Universal-formatted tape. The file default 
assignment is unit 12, but the user may assign any available 
unit. (See file assignment chart, table 5-1.) The relevant 
tape format is defined in appendix G. In earlier documentation 
of EOD-LARSYS , the file written to the SCTRUN unit is referred 
to as PLOTAP. 

5.3 TRANSFORMED DATA FILE ( TRFORM , UNIT 14) 

The DATA-TR processor outputs a multifile image file of trans- 
formed data. The image file may be produced in either the 
LARSYS III or Universal format defined in appendixes B- and C, 
respectively. The tape must be assigned to logical unit 14. 




Processor 

| Input 


Unit 

(a) 

File 

Unit 

No. 

Name 

No. 

Name 

DOTDATA 

11 

DATAPE 

Field cards (dots) 

19 

DOTUNT 

ISOCLS 

11 

DATAPE 

Field cards 

20 

SAVTAP 


10 

BMFILE 


16 

MAPUNT 


20 

SAVTAP 





19 

DOTTJNT 




LABEL 

20 

SAVTAP 

Field cards from ISOCLS 

16 

MAPUNT 


19 

DOTUNT 


2 

MAPTAP 


16 

MAPUNT 


20 

SAVTAP 





19 

DOTUNT 

CLASSIFY 

a 

DATAPE 

Field cards 

2 

MAPTAP 


19 

BMFILE 





SI 

SAVTAP 




DISPLAY 

H 

DATAPE 

Field cards — DO/DU or 

16 

MAPUNT 


E9 

DOTUNT 

test fields 




B 

MAPTAP 




SELECT 

20 

SAVTAP 


10 

BMFILE 


10 

BMFILE 




STAT 

11 

DATAPE 

Field cards 

20 

SAVTAP 

DATATR 

11 

DATAPE 

Field cards 

14 

TRFORM 


10 

BMFILE 




TRSTAT 

20 

SAVTAP 

Affine transformation 

20 

SAVTAP 


21 

CRDUNT 




NDHIST 

11 

DATAPE 

Field cards 

n 

NHSTUN 


16 

MAPUNT 


m 


SCTRPL 

□ 

SAVTAP 


12 

SCTRUN 



NHSTUN 





El 

BMFILE 




HIST 

ii 

DATAPE 

Field cards 

13 

HISFIL 

GRAYMAP 

ii 

DATAPE 

Field cards 

i ■ 



13 

HISFIL 


m 



Output 



Conditional or mixed 
cluste?. map for DISPLAY 
Relabeled 
Relabeled 



Histogrammed by class, 
subclass, or field 



a Other logical units are PCHUNT (punch unit, 7) , PRTUNT (printer, 6) , and RANDIO (direct 
access temporary file, 22). Hand-coded units are: RANDIO (22), CRDUNT (21), and 

PRTUNT (6) . 























































































The location on the scatter plot for each vector in the NHSTUN 
file is determined by its radiance values (if only two channels 
were histogrv-mmed) or by two linear combinations of radiance 
values (if more than two channels were histogrammed) . 

The color for the pixel is assigned by 

• Original radiance values 

• Mean value of the subclass or cluster to which the pixel 
was assigned during classification or clustering 

• Mean value of the test or training field from which the 
pixel was extracted 

• User-defined colors 

• Color extraction from a different pass when using 
multiregistered Landsat data. 

Optionally, for pixel color assignment, the SAVTAP file 
created by STAT or ISOCLS may be input. 

Optionally, a line-printer pixel frequency or log of pixel 
frequency (base 2) plot is given. The plot is printed with 
up to 16 different symbols. 

Figure 2-1 is a diagram showing the principal processing 
options and paths in EOD-LARSYS. 





RESULTS UN SATISFACTORY: 

PI. TYPE 2 OOTS, GO TO C 
PI , TYPE l DOTS, GO TO A 
ANALYST AIDS, GO TO D 

RESULTS SATISFACTORY: 

GO TO A, B, C OR END 


DISPLAYS: 

CLUSTER MAP (ISOCLS) 

MIXED OR CONDITIONAL CLUSTER MAP (LABEL) 
CLASSIFICATION MAP (DISPLAY) 

SCATTERPLOT (SCTRPL) 

IMAGE TAPE (DATATR OR DIRECT) 


Kay to decision points I through 12. 

1 a. Compute histograms and print pictorial gray seal* 

map of data from any channel, using HIST and 
GRAYMAP. 

b. Print pictorial gray seal* map only, using GRAYMAP. 

c. Compute training field statistics, write SAVTAP 
file, using STAT. 

2 a. Group training fields into classes, using ISOCLS. 
b. Omit clustering. 

3 a. (Re) label training field statistics, 
b. Omit labeling. 

A a. Determine subset or linear combination of channels 
that maximizes subclass separabi I ity, using SELECT, 
b. Transform training field statistics using TRSTAT . 

5 a. Create new image data tape, applying linear 

transformation matrix computed by SELECT, using 
DATATR. 

b. Perform supervised classification of 1 nage 
using CLASSIFY. 

6 a. Relabel previously labelled statistics file 

SAVTAP, using LABEL. 

b. Proceed to classification, using CLASSIFY. 

c. Use statistics file SAVTAP to provide start; 4 
cluster mean vectors for ISOCLS. 


a. Relabel dots in dot data file using LABEL. 

b. Display dots using DISPLAY. 

C. Cluster image using ISOCLS. 

a. Compute n-dlmenslonal histogram of selected 
data areas, using NDHIST, and -scatter plot, 
using SCTRPL. 

b. Proceed to labeling, using LABEL. 

a. Compute n-dimensional histogram of selected 
data areas, using NDHIST, and scatterpiot, 
using SCTRPL. 

b. Proceed to classification, using CLASSIFY. 

c. Evaluate discriminatory capability of channels, 
using SELECT. 

a. Proceed to classification, using CLASSIFY. 

b. Proceed to classification sumnary, using DISPLAY. 

a. Compute n-d! mens ional histogram of selected 
data areas, using NDHIST, and scatterpiot, 
using SCTRPL. 

b. Display image on display station and print 
resul ts. 

a. Results unsatisfactory (see annotation on 
flowchart) . 

b. Results satisfactory (see annotation on 
flowchart) . 


Figure 2-1. - Concluded. 




ORIGINAL PAGE IS 
OF POOR QUALITY 


ORIGINAL PAGE IS 
of poor qrALrn 


3. SYSTEM INPUT/OUTPUT FORMATS 

3.1 CARD INPUT/OUTPUT 

Card input to and output from the system must be one of the 
types listed below. It should be noted that card image files 
are normally used in remote processing applications. In the 
discussion following, "card image" should be understood 
for "card." 

3.1.1 PROCESSOR CARDS 

Processor cards identify the processor that is to be executed. 
The system monitor routine calls the appropriate processor, 
which initiates the loading of all routines used by the proc- 
essor. The processor card is always a $ symbol followed by 
the processor name and must always be punched left justified 
beginning in column 1. No blanks are allowed. The $ symbol 
and the first five characters are the unique processor identi- 
fication used by the system monitor routine. 

Below is a list of all processor cards recognized by the system, 
along with the section in which each processor is described. 


$HIST 

Section 

6 

$GRAYMAP 

Section 

7 

$STAT 

Section 

8 

$ISOCLS 

Section 

9 

$SELECT 

Section 

10 

$CLASSIFY 

Section 

11 

$DI SPLAY 

Section 

12 

$DATA-TR 

Section 

13 

$TRSTAT 

Section 

14 

$NDHIST 

Section 

15 

$SCTRPL 

Section 

16 



w 


$DOTDATA Section 17 

$ LABEL Section 18 

$EXIT Execution terminates when this card 

is encountered. 

3.1.2 CONTROL CARDS 

Each processor has its own set of control cards which allow the 
user to exercise various options in the particular processor or 
to change the default value assigned to certain parameters in 
the system. These cards must immediately follow a processor 
card. The control cards are identified by a keyword in col- 
umns 1 through 10 of the card. Only the first six characters 
are used for testing. In columns 11 through 72, the parameter 
values or options are indicated. These columns are free form, 
blanks are ignored (unless of legitimate parameter value) , 
and multiparameter values or options are separated by commas . 
Columns 73 through 80 of che card are not used. With the 
exceptions of the FORMAT, *END, $END, and in some cases the 
STATFILE cards, control cards may occur in any order. (The 
STATFILE control card exception is noted in the section for 
the appropriate processor.) If the list of parameter values 
for a given keyword is too long for one card, the remaining 
values can be input on another card with the same keyword. 

(The continuation of a CATEGORY control card is slightly differ- 
ent; see section 11, table 11-1.) In every processor, the *END 
control card indicates the end of a set of control cards, and 
the $END card indicates the end of field definition card input. 

The FORMAT card defines the format of the MSS data tape by a 1 
(Universal) or by a 2 (LARSYS III) in column 11 (or subsequent 
columns) . This card precedes all others in a job setup. The user 
should ensure that all files written in the run are consistent 
in format. 


3.1.3 CLASS, SUBCLASS, AND FIELD DEFINITIONS 

A field is a specific block of data to be extracted from the 
input MSS data tape (DATAPE) and processed. It is defined by 
a sample increment, a line increment, and from 1 to 10 vertices. 
Optionally, the user may associate a name with each field. The 
alphanumeric field description is located in columns 1 through 6. 
In columns 11 through 72, sample and line increments are sepa- 
rated by a comma and enclosed in parentheses. A comma separates 
the increments and each of the following vertices. The vertices 
must be arranged in clockwise order. Sample and line numbers 
which describe a vertex are separated by a comma and enclosed 
in parentheses. The sample number must be given first for 
each vertex. More than one card may be used to describe a 
field. An asterisk occurring after a vertex indicates a 
continuation card is to be read beginning in column 11. A 
vertex must be completed on a card and cannot be split between 
two cards. The numbers which describe the increments and 
vertices must be integers. 


It is the user's responsibility to ascertain that all defined 
fields are within the bounds of the MSS image. In determining 
which pixels belong in a particular field, the EOD-LARSYS 
examines the pixel intercepts of each scan line with each 
side of the field. The pixel intercept X, with the scan line L 
and the side defined by vertices (X 1 ,Y 1 ) and (X 2 ,Y 2 ), is cal- 
culated by the equation: 


X = 


(L - Y x ) (X, - X,) ^ 

(Y 2 1 


(3-1) 


The value of X is computed as a floating-point number; however, 
the actual pixel intercept must be an integer number. Therefore, 
if the fractional part of X is greater than one-half, the pixel 
intercept is the next higher integer number. If the fractional 


n 


4 


part of X is less than one-half, the pixel intercept will be 
the next lower integer number. When the fractional part of 
X is exactly one-half, the integer pixel intercept depends on 
the direction of movement from the point (X^,Y 1 ) to (X 0 ,Y 2 ). 

If Y^ is less than Y 2 , the pixel intercept is the next higher 
integer. If Y^ is greater than Y 2 , the pixel intercept is the 
next lower integer number. 

After all pixel intercepts for a given scan line have been 
determined, the intercepts are taken in pairs and all pixels 
between and including the pair of intercepts are included in 
the field. In the following example for scan line L, all 
pixels between and including P^ and P 4 are included, and all 
pixels between and including P^ and P 4 are included. 



The following three examples describe field definition cards 
and the fields they describe. In example 1, the sample 
and line increments are equal to 1 for field FI, and there 
are four vertices. 
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In example 2, the field F2 has the same vertices as FI; however 
the sample increment is 2 and the line increment is 3. 
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In example 3, the sample and line increments for field W187 are 
equal to 1, and there are six vertices. 

W187 (1,1) , (324,266) , (335,264) , (338,271) , 

(330,272) , (329,269) , (326,269) 
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Except for SELECT, TRSTAT , and Scatter Plot (SCTRPL) , every 
processor accepts the input of field definition cards. Field 
definitions are always input between the *END and the $END 
control cards. In the STAT and ISOCLS processors, fields must 
be associated with a class or subclass name. In the DISPLAY 
processor, fields may be test fields or designated fields. 

In the NDHIST processor, fields are associated with class, 
subclass, test or training, or any user-defined field. 

The fields defined in STAT and ISOCLS are called training fields, 
and the data within these fields are used for computing statis- 
tics. Training fields are grouped into subclasses, and sub- 
classes are further grouped into classes, using the STAT 
processor. In ISOCLS, training fields are grouped into classes, 
and the clustering procedure breaks the class data into sub- 
classes (clusters). To allow for these groupings, cards bearing 
a class name and a subclass name are necessary. 



A class name card has the keyword CLASSNAME beginning in 
column 1 and the alphanumeric name of the class left justified 
in columns 11 through 14 of the card. Blanks should not be 
embedded in the class or subclass names. 

A subclass name card has the keyword SUBCLASS beginning in 
column 1 and the alphanumeric name of the class left justified 
in columns 11 through 14 of the card. 

In STAT, a CLASSNAME card must immediately follow the *END 
control card. The CLASSNAME card is followed by one or more 
SUBCLASS cards, each of which must be followed by one or more 
field definition cards. See the example for STAT (section 8.4.4) . 

In ISOCLS, a CLASSNAME card must immediately follow the *END 
control card. The CLASSNAME card is immediately followed by 
one or more field definition cards. The data from the fields 
associated with a given class name are clustered as one data 
set. The class is broken into subclasses (clusters) which do 
not have field boundaries. So, even though statistics are 
computed on a subclass level, training fields cannot be asso- 
ciated with subclasses in ISOCLS. See the example for ISOCLS 
(section 9.5.4) . 

In DISPLAY, test fields (if input) must be identified by a 
previously defined class or subclass name. When associated 
with classes, a CLASSNAME card should immediately follow the 
*END control card. Test fields for that class should follow 
immediately. When associated with subclasses, a SUBCLASS 
card should immediately follow the *END card, followed by 
the test fields for that subclass . 


*?/ 


Designated fields are the other type of field input to DISPLAY. 
Fields may be DU or DO. For input of designated fields, a card 
with the keyword DESIGNATE beginning in column 1 and the keyword 
OTHER or UNIDENTIFIABLE beginning in column 11 must precede the 
field definition cards. See section 12.4.4 for sample input of 
test and designated fields. 

3.1.4 SPECIAL SYSTEM FILES 

The card image files described in this section are special files 
normally output from one processor to be used at some future 
time for input to another processor. However, if the user can 
obtain all of the information needed for any of the special 
card image files from some other source, such information may 
be input directly to the processor if the formats described in 
this section are followed. 

These files are always referred to in the job setup with the 
control cards for the particular processor. The first card 
image of each file acts as a keyword which initiates the input 
of the file. It is not necessary to input the same file to 
more than one processor in the same run. 

3. 1.4.1 Module STAT File 

The module STAT file is optional output from the STAT, ISOCLS, 
and TRSTAT processors. It contains either the statistics (mean 
vectors and covariance matrices) for all the subclasses input to 
STAT or for clusters computed by ISOCLS or the transformed sta- 
tistics for all subclasses or clusters input to TRSTAT. These 
statistics are needed in the computation of the probability 
density function in CLASSIFY and the computation of separability 
measures in SELECT. 


This file also contains all the training field boundaries, the 
class and subclass numbers to which the training fields belong, 
the class and subclass names, the number of subclasses in each 
class, and the number of points in each subclass or cluster. 

By defining the required training fields in STAT, the user has 
absolute control over the data samples which will define a 
subclass from the MSS data tape (DATAPE) . Every data sample 
occurring in any one of the training fields defined by a par- 
ticular subclass is used in computing the mean vector and 
covariance matrix for that subclass. 


In the clustering processor ISOCLS, the user has no control 
over the specific samples which comprise a cluster. The proc- 
essor determines which data samples are used in computing the 
mean vector and covariance matrix for each cluster. Because 
of the desirability of using these cluster statistics in other 
processors, the ISOCLS processor creates a file in the same 
format as the STAT processor. The file may be punched if 
desired. Training fields are associated with classes rather 
than subclasses. Clusters are given a six-character name. 

The first three characters are the first three characters of 
the class name associated with the cluster, and the last three 
characters are digits. The digits for the subclasses are in 
sequential order. 

When the module STAT file is input to the CLASSIFY or SELECT 
processor, the user may request subsets of the statistics to 
be used for classification or channel selection via the CHANNELS 
and SUBCLASS control cards in both processors. Subclasses are 
numbered as they were input to STAT, and clusters are numbered 
as they were created in ISOCLS. The channels are numbered as 
they occur on the MSS data tape (DATAPE) . To select a subset 
of the statistics in the module STAT file, the user should 
indicate by number the subclasses and/or channels he wishes 



to use. (Unless the user has previous knowledge of the number 
of clusters in the module STAT file, he or she cannot accurately 
select a subset of the clusters when executing ISOCLS back to 
back with another processor.) 

The first card in the module STAT file acts as a control card, 
with the keyword MODULE initializing the input of the remainder 
of the file. The entire file is composed of the card image 
types listed below. All integers should be right justified in 
the specified field, and alphanumeric characters should be 
left justified in the specified field. 

• Card type 1 — Keyword MODULE in columns 1 through 6. 

• Card type 2 — Number of classes, subclasses, channels, 
fields, and vertices for training fields. 

Columns Type/format Definition 


7-10 

Integer/14 

Number of training 
or ISOCLS 

classes from STAT 

19-20 

Integer/12 

Number of training subclasses from 
STAT (clusters from ISOCLS) 

29-30 

Integer/12 

Number of channels 
of statistics 

used in computation 

38-40 

Integer/13 

Number of training 
STAT or ISOCLS 

fields input to 

49-52 

Integer/14 

Number of vertices 
fields 

in all the training 


• Card type 3 — Actual channels used in computation of 
statistics. 

Columns Type/format Definition 

11-12 Integer/12 Channel 1 

Integer/12 Channel 2 



13-14 


Definition 


r 


i 


15-16 Integer/12 Channel 3 

• • • 

• • • 

• • • 

69-70 Integer /I 2 Channel 30 

• Card type 4 — Training field information: The first card of 

the set.^ Names should not exceed four characters. 

Columns Type/format Definition 

1-6 Alphanumeric/ Field name 

A4 

11-12 Integer/12 Number of the class .associated with 

this field 

21-22 Integer/12 Number of the subclass associated 

with this field input to STAT 
(Since ISOCLS associates fields 
with classes, ISOCLS dummies this 
information by setting it equal 
to zero.) 

31-32 Integer/12 Number of vertices for this field, 

including closure point 

• Card type 5 — Vertices for the training field: Up to 

10 vertices plus the closure point are allowable for each 
training field, 7 vertices per card with coordinates 
ordered (sample, line) . The coordinates are listed in a 
clockwise manner, with the coordinate having the smallest 
sample number listed first. ^ \ 


Card types 4 and 5 define a training field. To complete the 
set of information for one training field, one card of type 4 
and one or two cards of type 5 are required. The number of 
card sets is determined by the number of training fields. 



6. ONE-DIMENSIONAL HISTOGRAM PROCESSOR - HIST 


The processor HIST computes individual field histograms and a 
total histogram for all the fields and channels defined by the 
user. An individual statistics report is printed for every 
field histogrammed. The report contains field descriptions, 
data ranges, means, standard deviations, and normalized ranges 
(mean ±3 standard deviations) . 

A cumulative histogram of all the fields is calculated and 
written on an internal file to be read later by the GRAYMAP 
processor. Like the field histograms, a statistics report 
is printed for the combined fields. 

The input DISPLAY control card allows the user to obtain a 
line -printer plot of the histograms. A histogram for each 
channel on the DISPLAY card (described in table 6-1) is dis- 
played for each field, along with a cumulative histogram for 
all the fields. 

6.1 INPUT FILES 

An MSS data tape (DATAPE) must be input. The tape assignment 
defaults to logical unit 11 but, by input of the DATAFILE 
control card, the user may assign any available logical unit. 
(See table 5-1 for file assignments and section 3.2, MSS 
Image Data Tapes, for further information on format.) 

6.2 OUTPUT FILES 

The HIST processor writes a file for the GRAYMAP processor on 
logical unit 13. This file (HISFIL) contains the histogram 
data for each channel requested. 




6 . 3 SCRATCH FILES 


The HIST processor does not require an additional scratch file. 
6.4 CARD INPUT 

The formats for all system card input are defined in section 3.1. 

6.4.1 PROCESSOR CARD 

The processor keyword is left justified starting in column 1. 

For example, 

$IIIST 

This card directs the system monitor routine to select the HIST 
processor and causes all the routines used by the HIST processor 
to be loaded into the system. 

6.4.2 SPECIAL SYSTEM FILES 

The HIST processor does not use any special input files. 

6.4.3 CONTROL CARDS 

Control cards allow the user to input various options. These 
cards are identified by a keyword left justified in columns 1 
through 10 of the card, with the parameter values or additional 
keywords in columns 11 through 72 (beginning in any column 
after column 10) . These control cards may be in any order, 
but they must be the first cards after the processor card 
$HIST. Table 6-1 lists all available options, along with 
their default values. 


6.4.4 FIELD DEFINITIONS 


Fields to be histogrammed are input immediately following the 
*END control card. The card column format for field defini- 
tions is defined in section 3.1.2. Input of field definition 
cards is terminated by the $END control card. 

6.5 CARD OUTPUT 

This processor does not output any card decks. 

6.6 RESTRICTIONS 

a. The maximum number of channels is 30. 

b. The number of histograms requested to be plotted may be 
limited if internal dimensions are too small for all user 
requests. (For example, if the user requests 30 channels 
to be histogrammed, only 14 of those histograms may be 
plotted; however, all 30 will be histogrammed.) 

This limitation is a function of the number of channels 
requested on the CHANNELS control card. If too many 
channels are indicated on the DISPLAY control card, a 
diagnostic is printed but execution continues. 

c. The DISPLAY card must be a subset of the CHANNELS card. 

d. The data for all channels for one scan line are unpacked 
into an array dimensioned 12 000. If the number of channels 
times [(sample end - sample begin) /sample increment] exceeds 
12 000, a diagnostic message is printed. Sample end is 
reset to fit the dimensions and execution continues. 

6.7 DIAGNOSTIC MESSAGES 

The diagnostic messages and the subroutines in which t'.ey 
appear are as follows. 




6.7.1 SUBROUTINE HISTGM 


i 



The data for all channels for one 
scan line are unpacked into an 
array dimensioned 12 000. If the 
number of channels times [ (sample 
end - sample begin) /sample incre- 
ment] exceeds 12 000, this diag- 
nostic is printed. Sample end is 
reset to fit the dimensions and 
execution continues . 


Explanation 


The user has input more than 
50 fields, and only the first 
50 field descriptions will be 
printed in the "Data Blocks 
Histogrammed" portion of the 
total report; however, all the 
input fields were included in 
the calculations of the "Total 
Histogrammed Statistics." 


>lanation 


Check unit assignment and file 
number . 



) 

t 



b. TOO MANY CHANNELS ARE User requested too many histograms 

BEING HISTOGRAMMED AND to be plotted. The number of 

PLOTTED, NO. OF CHANNELS histograms plotted varies accord- 
WAS RESET TO ing to the number of channels 

histogrammed. 

c. INVALID CARD — IGNORED. Inappropriate or defective card 

read. Make sure cards are 
punched correctly. 

d. XHIGH - XLOW WAS LESS Range of pixel radiances required 

THAN 100. XHIGH WAS to be >_100. 

RESET TO XXX, OR XLOW 
WAS RESET TO XXX. 


e. CHANNEL, 12, IS NOT A A channel on the DISPLAY card is 

SUBSET OF THE CHANNELS not a member of the set of chan- 

GIVEN ON CHANNELS CARD. nels on the CHANNELS card. 

f. BAD SUPERVISOR CONTROL Check spelling of keywords. 

CARD. 



TABLE 6-1.- HIST PROCESSOR OPTIONS AND CONTROL CARDS 


Parameter and 


Keyword 

default values 

Function 

(a) 

(b) 

CHANNELS 

C 1' C 2 ' C 3' * * * ' C k 
k<30 

Default: None 

Channels to be histogrammed, 
C^/Cj/C^ , • • • /C^, should be 
integer numbers separated by 
commas . 

SIZE 

XHIGH=K 

0<K<255 

Default: XHIGH=255 

K is an integer which sets the 

maximum radiance value which 

will be histogrammed. XHIGH 

becomes X„ = „ on the X-axis of 
max 

the histogram plot. c 

SIZE 

XLOW=J 
0<J<XHIGH 
Default: XLOW=0 

J is an integer which sets the 

minimum radiance value which 

will be histogrammed. XLOW 

becomes X . .. on the X-axis of 
mm 

the histogram plot. c 

SIZE 

YSIZ=L 

0<L<f(x) max 
Default: YSIZ=15 

L is an integer which sets the 
height of the Y-axis (number 
of print lines) . Using the 


input YSIZ, the Y-axis scale 
for the histogram plot will be 
determined by the processor to 


be: f (x) +(YSIZ-1)/YSIZ. 

ulaX 


a The keyword must be left justified in card columns 1 through 10. 

The parameter and default values are in card columns 11 through 
72 (beginning in any column past 10) . 

c The difference between XHIGH and XLOW must be at least 100. 



TABLE 6-1.— Continued. 


Keyword 

DISPLAY 


DATAFILE 


HED1 


HED2 


DATE 


Parameter and 
default values 


C 1' C 2' C 3 

k<30 

Default: 




No plots 


UNIT=N,FILE=M 
Default: N=ll ,M=1 


Function 

Channels for which histograms 
will be plotted. C^,C 2 ,C^, 

••*,0^ must be a subset of 
the CHANNELS card. 

N is the Fortran logical unit 
number to which the MSS data 
tape (DATAPE) has been assigned; 

M is the file number for the tape 
to bo processed. For back-to- 
back executions of more than one 
processor, if using the same 
file number, only one DATAFILE 
control card need be submitted. 


Any 60 characters 
beginning in column 11 
Default: LYNDON B. 

JOHNSON SPACE CENTER 


Replaces first header line with 
the indicated characters in the 
parameter field. 


Any 60 characters Replaces second header line 

beginning in column 11 with the indicated characters 
Default: HOUSTON, in the parameter field. 

TEXAS 


Any 12 characters Prints the indicated 12 charac- 

beginning in column 11 ters in the right corner of the 
Default: Current date heading in place of the current 

date. 

Any 60 characters Prints a comment line using the 

beginning in column 11 60 characters found in the 

Default: No comments parameter field, 

printed 


COMMENT 


TABLE 6-1.— Concluded. 


Keyword 

*END 


Parameter and 

default values Function 

Blank Signals the end of the control 

cards . 

Signals the end of all card 
input for the processing 
function. 


$END 


Blank 


7. GRAYMAP PROCESSOR 

The chief purpose of GRAYMAP is to produce alphanumeric pictorial 
printouts of digitized MSS data. Each data sample is eight bits, 
providing 256 possible gray levels for each MSS data channel. 

To allow a meaningful distinction in gray-scale tones, GRAYMAP 
assigns each of the 256 levels to 1 of as many as 16 possible 
symbols. These symbols may be preassigned or arbitrarily 
assigned for each run. The specifications for the bin edges 
for each symbol may be assigned arbitrarily by the user for each 
run or computed from the histogram data in order to result in 
equal activity for each of the symbols. In any case, the data 
are subsequently output in terms of symbols, arid each symbol 
represents a range of data values in which the corresponding 
data points fall. 

7.1 INPUT FILES 

An MSS data tape (DATAPE) must be input to the GRAYMAP processor. 
The tape assignment defaults to logical unit 11; however, by 
input of the DATAFILE control card, the user may assign any 
available logical unit. (See table 5-1 for file assignments 
and section. 3.2, MSS Image Data Tapes, for further information 
on format. ) 

The GRAYMAP processor requires the bin levels to be input on a 
control card or computed from the histograms output by the HIST 
processor on the HISFIL file. When the bin levels are to be 
computed, logical unit 13 may be assigned to either disk or tape 
or allowed to default to disk (no assignment) . If the HIST 
processor has not been executed prior to running GRAYMAP and bin 
levels have not been input, a default histogram of every 10th 
line for 500 lines and every 10th sample for 200 samples is 
computed, and HISFIL is created on logical unit 13. 




Figure 7-1 shows the interaction of the HIST and GRAYMAP 
processors. 


7.2 OUTPUT FILES 

No files are output by the GRAYMAP processor. 

7.3 SCRATCH FILES 

The GRAYMAP processor does not require additional scratch files 

7.4 CARD INPUT 

All system card input formats referred to in this section are 
defined in section 3.1. 

7.4.1 PROCESSOR CARD 

The processor keyword is left justified starting in column 1. 
For example, 

$ GRAYMAP 

This card directs the system monitor routine to select the 
GRAYMAP processor and initiates loading of routines used 
by GRAYMAP. 

7.4.2 SPECIAL SYSTEM FILES 

None of the special system files is required for this processor 

7.4.3 CONTROL CARDS 

Table 7-1 lists all available options and control cards recog- 
nized by GRAYMAP, along with their default values. 

7.4.4 FIELD DEFINITIONS 

Fields for which gray-scale maps are desired must follow the 
*END control card. See section 3.1.3 for the format of field 


definition cards. Field definition input is terminated by the 
$END control card. 


7.5 CARD OUTPUT 

The GRAYMAP processor produces no card output. 

7.6 RESTRICTIONS 

The system-related restrictions in section 19 apply to this 
processor. 


The maximum number of channels 
number of bin levels is 16. 

7.7 DIAGNOSTIC MESSAGES 

7.7.1 SUBROUTINE PICT 
Message 

a. THE NO. OF CHANNELS FOR 
THIS FIELD HAS BEEN 
REDUCED TO XXX SO ALL 
THE INFORMATION WILL FIT 
ON DISK. 

MAKE ANOTHER RUN TO 
GRAYMAP OTHER CHANNELS. 

b. FIELD TOO LARGE, 
TERMINATING . 

C. YOU HAVE ASKED FOR TOO 
MANY SAMPLES. THE LAST 
SAMPLE IS 


allowed is 30, and the maximum 


Explanation 
Self explanatory. 


Data exceed allocated storage. 

The last sample is reset to the 
last sample on the data tape. 



7.7.2 SUBROUTINE SETUP 6 


4 


TABLE 7-1.- GRAYMAP PROCESSOR OPTIONS AND CONTROL CARDS 


Keyword 

(a) 


CHANNELS 


Parameter and 
default values 

(b) Function 

C 1 /C 2' C 3' * * * ' C k Provides pictorial printout 

k£30 for requested channels. 

Default: Gray map for 

all channels on HISFIL 

(created by a previous 

execution of HIST) 


BINLEVEL N^N^N^ • • • ,N k 
k<16 

Default: Histograms 

used to set bin levels 

SYMBOLS S i' S 2' S 3'***' S k 
k<16 

Default: Two sets of 

10 symbols overprinted, 
resulting in one of 
$, X, 9, 0, ,b 


Upper bin edges for gray-scale 
levels with a 'range of 0 to 255 
and a maximum of 16 levels; 
the last bin level should 
always be 255. 

Character set separated by 
commas, with a maximum of 
16 symbols per SYMBOL card. 

If 2 sets are input, the 
second overprints the first. 

The number of symbols input 
on one card determines the 
number of bin levels when 
using the histograms to set 
the levels. Blank is a 
legitimate character. 


a The keyword must be left justified in card columns 1 through 10. 

The parameter and default values are in card columns 11 
through 72 (beginning in any column past 10) . 



TABLE 7-1.— Continued 


Parameter and 

Keyword default values 

DATAFILE UNIT=N , FILE=M 

Default: N=ll ,M=1 


HED1 Any 60 characters 

beginning in column 11 
Default: LYNDON B. 

JOHNSON SPACE CENTER 

HED2 Any 60 characters 

beginning in column 11 
Default: HOUSTON, 

TEXAS 

DATE Any 12 characters 

beginning in column 11 
Default: Current date 


Function 

N is the Fortran logical unit 
number to which the image data 
tape has been assigned; M is 
the file number on the tape to 
be processed. For back-to-back 
executions of more than one 
processor, if using the same 
file number, only one DATAFILE 
control card need be submitted. 

Replaces first header- line 
with the indicated characters 
in the parameter field. 

Replaces second header line 
with the indicated characters 
in the parameter field. 

Prints the indicated 12 charac- 
ters in the right corner of the 
heading in place of the current 
date. 


COMMENT 


*END 


Any 60 characters Prints a comment line using 

beginning in column 11 the 60 characters found in 
Default: No comments the parameter field, 

printed 


Blank 


Signals the end of the control 
cards . 
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TABLE 7-1.— Concluded 


Keyword 
$END 




Parameter and 

default values Function 

Blank Signals the end of all card 

input for the processing 
function. 






Figure 7-1.— Functional diagram showing interaction of the 
HIST and GRAYMAP processors. 




SET BIN LEVELS FROM HISTO- 
GRAM SO THAT EACH SYMBOL HAS 
EQUAL ACTIVITY. 


PRINT GRAYMAP FOR USER- 
DEFINED FIELDS. 


1 

f 

f ENTER NEXT 

PROCESSING 

FUNCTION. ) 


■ al PA* a-: a- 

(A P JK QPALii’Y 


Figure 7-1.— Concluded 



8. STATISTICS PROCESSOR - STAT 


The statistics processor STAT computes the multichannel means , 
standard deviations, covariance matrix, and correlation coeffi- 
cient for each training field and all training subclasses which 
are defined through user input to the processor. In addition, 
at the user's option, histograms and spectral plots may be 
computed for each field and/or subclass. 

The STAT processor requires user input of both card images and 
an MSS data tape (DATAPE) . Card image input consists of an 
optional number of cards from the set of control cards defined 
in table 8-1 and the training field definitions described in 
section 3.1.3. The required input MSS data must encompass the 
area of interest specified in the training field definitions. 

The processor is activated and initialized by a specific proc- 
essor card defined in section 8.4.1. All processor functions 
which are available as options to the user are directed by means 
of the input control cards or the built-in defaults for any 
control card which is not input. 

In addition to the optional printouts under the direction of the 
control cards, the STAT processor creates the output file SAVTAP 
which contains the computed statistics (mean vector and covari- 
ance matrix) for each training subclass. The training subclass 
statistics optionally may be output on punched cards (the module 
STAT file) . Both the output statistics file SAVTAP and the 
output module STAT file are in a format acceptable to the statis 
tics input requirements of other processors in the EOD-LARSYS. 
Figure 8-1 gives the functional flow of the STAT processor. 



The mean vector for the i th subclass is computed as follows: 


= x li' x 2i' -, '' X pi'" , ' X Pi 


where 


X . = i— y X . » average value in channel p for subclass i 
pi N. P3 

j-i 

p = channel number. 

P = largest channel number. 

= number of samples in all training fields for subclass i. 

X . = the ith sample of the MSS data for channel p (a value 
PD 

between 0 and 255) . 

u. = mean vector for the i th subclass. 

*1 

The covariance matrix for the ith subclass is computed as 
follows: 


k lli k 12i 


k lPi 


k 21i k 22i 


K 1 - 


k pqi 


k Pli 


k PPi 


where 


k pqi " N. - 1 £ 


- h t h £ 


q = channel number. 


8-^f 
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Closely related statistics are the standard deviation and cor- 
relation coefficient for the i th subclass, which are computed 
as follows: 


°pi " (k PPi ) 


1/2 




w (k .k .)W 

ppi qqi 1 


(8-3) 


where 

k ^ = element of the covariance matrix for subclass i; the 
variance between channels p and q. 

a ^ = standard deviation in channel p for subclass i; - p = q. 

P . = correlation coefficient between channels p and q for 
subclass i. 


8.1 INPUT FILES 

An MSS data tape (DATAPE) must be input to the STAT processor. 
The tape assignment defaults to logical unit 11; however, by 
input of the DATAFILE control card, the user may assign any 
available logical unit. (See table 5-1 for file assignments 
and section 3.2, MSS Image Data Tapes, for further information 
on format.) 

8.2 OUTPUT FILES 

The STAT processor always outputs the statistics on the SAVTAP 
file and, optionally (by means of the OPTION PUNCH control card) 
provides the module STAT file on cards. (See section 3. 1.4.1 
for format of card file.) 
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The required output file SAVTAP will contain the class names, 
subclass names, and the subclass statistics (mean vector and 
covariance matrix) computed by the STAT processor for every 
subclass defined. The output statistics file must be assigned 
to either disk or tape. The tape assignment defaults to logical 
unit 20; however, by input of the STATFILE control card, the 
user may assign any available unit. (See table 5-1 for file 
assignments and table 8-1 for control card description.) 

If the STATFILE control card is used, the statistics from more 
than one execution of STAT may be saved on the same tape. 

8.3 SCRATCH FILES 

The STAT processor does not require the use of a separate 
scratch file. 

8.4 CARD INPUT 

The specific card column formats for the information to be 
input on the processor and control cards are given in sec- 
tions 3.1.1 and 3.1.2. Table 8-1 describes the complete set 
of keywords and option parameters recognized and acted upon 
by the STAT processor. 

If possible, each keyword and its option parameters are to be 
completely contained on one control card. However, if more 
parameters are required than can be contained on one card, the 
control card may be repeated and the parameters continued on 
the next control card. The parameters for a control card of a 
given type will be cumulative over all cards of that type. 

The control cards follow the $STAT processor card. All options 
available on the STAT processor have a default setting which is 
used by the processor for those option parameters not input via 


control card. The control card *END must be input to signify 
the end of the set of control cards. Immediately following the 
*END card, a set or sets of CLASSNAME, SUBCLASS, and training 
field definition cards must be input. See section 8.4.4 for 
further details on training field definitions. 

8.4.1 PROCESSOR CARD 

The processor keyword is left justified starting in column 1. 

For example, 

$STAT 

This card directs the system monitor routine to select the STAT 
processor and initiates the input of STAT processor control 
card image file. 

8.4.2 SPECIAL SYSTEM FILES 

The processor does not expect input of any of the system- 
generated files described in section 3.1.4. 

8.4.3 CONTROL CARDS 

Table 8-1 gives the complete set of control cards which the user 
may input to direct the STAT processor functions and the default 
functions performed by the processor. With the exception of the 
*END and $END control cards, the sequence of the control cards 
is optional. The *END card must immediately follow the last 
control card, if any; the CLASSNAME, SUBCLASS, and training 
field definition cards must immediately follow the *END card; 
and the $END card must immediately follow the last card of the 
input training field definitions. 



8.4.4 CLASS, SUBCLASS, AND FIELD DEFINITIONS 

All CLASS, SUBCLASS, and field definition cards occur between 
the *END and $END control cards. The formats for these cards 
are given in section 3.1.3. Training fields are grouped into 
statistically similar subclasses, and subclasses are grouped 
further into classes. 

A training class is defined to the processor by one card con- 
taining the keyword CLASSNAME in columns 1 through 9. The 
user-determined alphanumeric name to be assigned to the class 
begins in column 11 and may contain a maximum of four charac- 
ters (through column 14) . At least one CLASSNAME card must 
be input. 

A CLASSNAME card must be followed by at least one subclass 
grouping. A subclass grouping is on a SUBCLASS card followed 
immediately by one or more field definition cards. All fields 
defined by field definition cards following the SUBCLASS card 
will contribute a cumulative sample set from which the training 
subclass statistics will be computed for the named subclass. 

The set of cards — one SUBCLASS card followed by one or more 
field definition cards — generates the statistics for one train- 
ing subclass. The number of sets of SUBCLASS and field defini- 
tion cards is determined by the number of sets of training 
subclass statistics. The number of training fields to be 
defined for one given subclass is not restricted. The following 
example shows the grouping of subclasses into classes. 


$STAT 



• (Control 

cards) 




*END 




Classes Subclasses 

CLASSNAME 

CLASS1 




SUBCLASS 

SUB 11 



1 

l (Fields) 

SUB 11 

(of 

CLASS1) 

1 

SUBCLASS 

SUB 12 



2 

; (Fields) 

SUB 12 

(Of 

CLASS 1) 


CLASSNAME 

CLASS2 




SUBCLASS 

SUB 21 



2 3 

• (Fields) 

SUB 21 

(of 

CLASS2) 


CLASSNAME 

CLASS 3 




SUBCLASS 

SUB 31 



4 

; (Fields) 

SUB 13 

(of 

CLASS 3) 


SUBCLASS 

SUB 32 



3 5 

• (Fields) 

SUB 32 

(of 

CLASS3) 


SUBCLASS 

SUB 33 



6 

; (Fields) 

SUB 33 

(of 

CLASS 3) 


$END 



8 . 5 CARD OUTPUT 


The STAT processor will optionally output on punched cards the 
module STAT file (section 3. 1.4.1). The module STAT file 
contains the training field vertices, the subclass names for 
each training subclass, the subclass numbers assigned to each 
training subclass, the association of training fields to each 
subclass and class, and the computed statistics for each train- 
ing subclass. 

The module STAT file is output by the processor on cards only 
when specified by user input of the OPTION PUNCH control card. 

8.6 RESTRICTIONS 

The system-related restrictions in section 19 apply to the 
STAT processor. 

In addition, a core storage limitation is associated with the 

total storage required by the training subclass statistics and 

the various options (i.e., producing histograms and spectral 

plots) . The upper limit on core storage available for all 

requirements generated by the input to the STAT processor is 

10 600 locations. Each subclass covariance matrix requires 

2 

approximately 1/2 (number of channels) locations; each subclass 
mean vector requires locations equal to the number of channels; 
and each training field requires seven locations. If a large 
number of subclasses, channels, and training fields in combina- 
tion with one or more of the options available by means of the 
OPTION control cards causes the core storage limits to be 
exceeded, the STAT processor prints a diagnostic message request- 
ing the user to decrease options, after which it terminates 
execution (see section 8.7). 


' \s 

•i 


The following formula determines the maximum number of fields 
that can be input for a case without any histograms (eq. 8-4) 
and another case with subclass histograms (eq. 8-5) . 

10 600 - 5N0SPEC + 7MAXSUB + 4 + - . 2M ft^ UB . + 5 N OFEAT + 1 NOFEAT + 40 

NOFLO - ^ 

(8-4) 

where 

NOFLD = number of fields 

NOSPEC = number of subclasses grouped together for spectrogram 
(maximum of 20) 

MAXSUB = maximum number of subclasses 
NOFEAT = number of channels 

( 4 + 2 MAXSUB + 5 \ 

-NOFEAT + l) NOFEAT + 40 + XSIZ 

NOFLD * 


(8-5) 

If fields and subclasses need to be histogrammed, a value for 
XSIZ (NOHIST) + 1 should be added to the numerator of equa- 
tion (8-5) , where 

XSIZ = range of histogram (maximum of 101) 

NOHIST = number of channels histogrammed 

8.7 DIAGNOSTIC MESSAGES 

The diagnostic messages provided by the STAT processor are listed, 
along with probable cause and remedy of the condition which 
prompted the message. During statistical computations, other 
messages also may be output by utility routines common to both 
STAT and other processors in the EOD-LARSYS. See the system- 
related messages in section 6 for additional messages obtained 
from a STAT execution. 



8.7.1 SUBROUTINE SETUPl 


a. 



///// from SUBR. SETUPl - 
BAD CONTROL CARD ENCOUN- 


TERED INPUT CARD IS 

, 'CCCC ••• CCC' 


b. *** STAT/SETUPl — 
ERROR IN OPTION (S) 
REQUESTED - SCAN OF 
OPTION (S) DISCONTINUED 
AT CARD COLUMN XX *** 


C. ERROR ON DATAFILE CARD 
d. ERROR ON STATFILE CARD 


i 


Explanation 


The input card which was read 
has none of the legitimate 
keywords to identify it as a 
recognizable control card. 

The card which caused the 
message is printed out as 
part of the message. Although 
the processor will continue to 
read more control cards, this 
is an indication of an error 
in the deck setup. The deck 
should be checked for proper 
control cards and proper 
sequence of cards . 


An OPTION control card is not 
acceptable to the processor. 
The scan of the options will 
be discontinued by the proc- 
essor, and any options speci- 
fied beyond the erroneous one 
will not be activated for the 
run. The processor continues 
with reading of the next con- 
trol card. (See section 3.1.2 
and table 8-1 for correct 
OPTION control card usage . ) 


Check format and unit 
assignment. 


Check format and unit 
assignment. 



Message 


e. * * *MAXSUB=XX MAX NO. 

OF SUBCLASSES CANNOT BE 
GREATER THAN YY 
MAXSUB SET=YY PROCEEDING 
TO NEXT OPTIONS (S) *** 


f. /////FROM SUB. SETUP 1 

DECREASE OPTIONS ***** 
TERMINATING PROGRAM EXECU- 
TION FROM SUBR. SETUP1***** 


Explanation 

The maximum subclass number 
input on the OPTION MAXSUB 
control card exceeds the maxi- 
mum number of subclasses that 
can be handled by the EOD- 
LARSYS. The processor will 
set the maximum number of 
subclasses, which will apply 
to subclasses read in from 
the input subclass/f ield 
definition deck. 

The STAT processor has run out 
of internal storage to handle 
the combination of the quanti- 
ties of input training fields, 
subclasses, and channels. 
Internal storage is fixed at 
10 600 locations. Each sub- 
class required roughly 
1/2 (number of channels) loca- 
tions for the subclass sta- 
tistics. If histograms or 
spectral plots of subclasses 
and/or fields are requested, 
additional internal storage is 
required. The options speci- 
fied in the run deck (i.e., 
histograms and spectral plots) 
and possibly the quantities 
of subclasses, channels, and 
training fields must be 
decreased or eliminated in 


^<L1 

(*7 


I 



g, CHECK CHANNELS OR CLASS 
NOS. REQUESTED — 

CANNOT BE LESS THAN OR 
EQUAL ZERO, OR GREATER 
THAN 30 

***** TERMINATING program 
EXECUTION FROM SUBR. 

SETUP 1 ***** 


h. ***** STAT/LEARN — MAX. OF 
XX SUBCLASSES EXCEEDED — 
FIRST XX SUBCLASSES USED — 
REMAINDER IGNORED 


8-12 



Explanation 


order to get a successful run 
within the core storage 
limitation. 


If the channel numbers speci- 
fied on a HISTO or CHANNELS 
control card are not integers 
within the range 1 through 30, 
this message results. The 
processor terminates execu- 
tion after printing this 
message. Check the format 
of the applicable processor 
control cards (see section 3.1.2 
and table 8-1) . 

The processor has read the 
maximum allowable number of 
subclass names and training 
fields to be associated with 
each subclass, and the next 
subclass name encountered in 
the training fie" d/subclass 
definition deck caused this 
diagnostic message. The first 
MAXSUB subclasses and associated 
training fields input are com- 
puted and the remainder are 
ignored by the processor. 


Message 


***** STAT/LEARN — MAX. 
OF XX SUBCLASSES EXCEEDED 
— FIRST XX SUBCLASSES 
USED — REMAINDER IGNORED 


Explanation 

The STAT processor has read 
the maximum number of subclass 
names and associated training 
fields from the input training 
field/subclass definition deck. 
The available internal storage 
has been filled, and no further 
training fields can be accepted. 
Training statistics will be 
computed for the subclasses and 
fields which have been read to 
this point, and the remainder 
are ignored by the processor. 



TABLE 8-1.- STAT PROCESSOR OPTIONS AND CONTROL CARDS 


Keyword 

(a) 

CHANNELS 


OPTION 


OPTION 


Parameter and 
default values 
(b) 


N i' N 2' N 3' 




l<k<30 

Default: k=30; unless 

the MSS data tape 
(DATAPE) has exactly 
30 channels , the 
default should not be 
taken . 

PUNCH 

Default: The module 

STAT card file is not 
punched, in which case 
statistics are output 
on the SAVTAP file 
only. 


MAXSUB=N 

Default: MAXSUB=15 


Function 

N's are the integer channel 
numbers used by the processor 
in computing training subclass 
and training field statistics; 
must be from the set of chan- 
nels available on the MSS 
DATAPE file. 

The subclass mean vector and 
covariance matrix for every 
subclass defined by user input 
will be punched on cards in a 
format acceptable as input to 
other processors ir. the system. 
This punched card deck is the 
module STAT file defined in 
section 3. 1.4.1. 

Informs the processor as to the 
maximum number of subclasses 
which will be input. The param- 
eter value is used for dimen- 
sioning purposes and reflects 
the maximum number of available 
computer storage locations being 
utilized for other options 
allowed by the STAT processor. 
This parameter must be set by 


The keyword must be left justified in card columns 1 through 10. 

" > The parameter and default values, are in card columns 11 through 72 (beginning 
in any column past 10) . 
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TABLE 8-1.- Continued. 


Keyword 


OPTION 


OPTION 


OPTION 


Parameter and 
default values 


COVAR 

Default: Statistics 

are not printed. 


COVAR=C 

Default: Statistics 

are not printed. 


COVAR=F 

Default: Statistics 

are not printed. 


Function 

the user if the number of sub- 
classes he is about to define 
will exceed the default. It is 
advisable to use this option 
when a large number of training 
fields are to be processed or 
when histograms have been 
requested. 

The multichannel means, stand- 
ard deviations, and covariance 
matrix (lower triangular por- 
tion) are printed out for each 
training subclass and training 
field defined in the input 
training field definition 
deck. 

The multichannel means, stand- 
ard deviations, and covariance 
matrix (lower triangular por- 
tion) are printed out for each 
training subclass defined in 
the input training field 
definition deck. 

The multichannel means , stand- 
ard deviations, and covariance 
matrix (lower triangular por- 
tion) are printed out for each 
training field defined in the 
input training field definition 
deck. 





TABLE 8-1.— Continued 


Keyword 

OPTION 

HISTO 


OPTION 


OPTION 


Parameter and 

default values Function 


NOCOVAR 


N r N 2 ,N 3 ,...,N k 

l<k<30 

Default: k-30 


No training subclass or train- 
ing field statistics are 
printed out. 

N's are integers which provide 
a list of channel numbers for 
use in the histogram options. 
The channel numbers must be 
from the set designated on the 
CHANNELS control card. Note : 
This control card does not 
initiate the histogram option. 


HIST A histogram showing frequency 

Default: No histograms distribution of pixels (reso- 

lution elements or radiance 
values) is printed out for 
every training field and every 
training subclass defined in 
the input training field 
definition deck. For each 
subclass (or field) , a histo- 
gram is provided for every 
channel designated on the 
HISTO control card. 


HIST=C A histogram printout is pro- 

Default: No histograms vided for every training sub- 

class defined in the input 
training field definition card. 
For each subclass, a histogram 
is provided for every channel 


TABLE 8-1.- Continued 


Keyword 


OPTION 


SIZE 


SIZE 


SIZE 


Parameter and 

default values Function 

designated on the HISTO con- 
trol card. 


HIST=F A histogram printout is pro- 

Default: No histograms vided for every training field 

defined in the input training 
field definition deck. 


XHIGH=K 

0<K<255 

Default: XHIGH=220 


XLOW=L 

<3<L<XHIGH 

Default: XLOW-120 


YSIZ=J 
0<J<f (x) 


max 


Default: YSIZ=14 


XSIZ=K 

Default : XHIGH-XLOW 


K is an integer which sets the 

maximum radiance value which 

will be histogrammed. XHIGH 

becomes x of the X-axis 
max 

of the histogram plot. 

L is an integer which sets the 

minimum radiance value which 

will be histogrammed. XLOW 

becomes X_. of the X-axis of 
mm 

the histogram plot. 

J is an integer which sets the 
number of increments on the 
Y-axis of the histogram plot; 
therefore, it is the height 
(number of print lines) of the 
Y-axis. Using the input YSIZ, 
the processor will determine 
the Y-axis scale for the 
histogram plot to be 
f (x) max + (YSIZ-1) /YSIZ . 

Sets the range which will be 
histogrammed; maximum range 
is 101. 




SIZE 


TABLE 8-1. - Continued 


Keyword 

SPECTRAL 


OPTION 


Parameter and 

default values Function 


M lf M 2 ,M 3 ,M 4 

l^M^O 

Default: 4 subclasses 

per spectral plot; 
subclasses 1, 2, 3 , and 
4 on the first plot; 5, 
6, 7, and 8 on the sec- 
ond plot; etc. 


M's are integers which provide 
a list of from one to four 
subclass numbers for the sub- 
classes which are to be plotted 
on one single composite spec- 
tral plot. The subclass num- 
bers must be obtained from the 
set of subclasses defined in 
the input training field 
definition deck. Subclass 1 
is the first subclass defined 
in the deck, and subsequent 
subclass numbers are obtained 
by sequentially numbering the 
subclasses as they occur in 
the training field definition 
deck. 


SPECTRAL A spectral plot is printed 

Default: Spectral out for every training sub- 

plots for subclasses class and training field 

defined in the input training 
field definition deck. The 
plot consists of the subclass 
(or field) mean radiance value, 
mean standard deviation (a) , 
and mean -a plotted versus the 
channel (spectral band) for 
every channel designated on 
the CHANNELS control card. 
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TABLE 8-1.— Continued. 


Keyword 

OPTION 


OPTION 


SIZE 


DATAFILE 


Parameter and 
default values 

SPECTRAL=C 
Default: Spectral 

plots for subclasses 


SPECTRAL=F 
Default: Spectral 

plots for subclasses 

SPECBAS=I 

0<I<105 

Default: SPECBAS=75 


UNIT=N , FILE=M 
Default: N=ll ,M=1 


Function 

A spectral plot will be 
printed out for every sub- 
class defined in the input 
training field definition 
deck. 

A spectral plot will be 
printed out for every field 
defined in the input training 
field definition deck. 

I is an integer which sets the 
minimum radiance value on the 
Y-axis of the spectral plot 
(i.e., Y m i n ) • processor 

has a fixed Y-axis increment 
(3) and a fixed number of 
Y-axis values (50) . Using 
SPECBAS, the processor deter- 
mines the Y-axis range to be: 

Y min =SPECBAS ' Y max =SPECBAS+150, 
N is the Fortran logical unit 
number to which the MSS data 
tape (DATAPE) has been 
assigned; M is the file number 
on the tape to be processed. 
For back-to-back executions 
of several processors if the 
same file number is used, 
only one DATAFILE control 
card need be input. 
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Figure 8-1.— 


Functional flow chart for the 
STAT processor. 
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Figure 8-1.— Concluded. 
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9. ITERATIVE SELF-ORGANISING CLUSTERING SYSTEM 
PROCESSOR - ISOCLS 

A data set to be clustered by ISOCLS is defined by a class con- 
sisting of one or more fields from the MSS data tape (DATAPE) . 

Any number of classes may be defined and clustered as individual 
data sets with one entry into the ISOCLS processor. The user has 
control over the maximum number of clusters allowed per class via 
the CLUSTERS control card. However, the procedure may find fewer 
clusters than the maximum allowed. If the user plans to use the 
statistics generated from the clusters in later CLASSIFY or SELECT 
runs, he or she must exercise control over the maximum number of 
clusters. The SAVTAP file may contain statistics for up to 
75 clusters (or subclasses) , but only 60 can be used for proc- 
essing at any one time in CLASSIFY or SELECT. (Control cards 
are set out in table 9-1.) 

The clustering procedure used in ISOCLS (ref. 3) is an iterative 
procedure which assigns each MSS data sample to a specific 
cluster by determining the nearest (in absolute distance) cluster 
center and assigning the sample to it. At the end of each 
iteration (i.e., when all samples have been assigned to' a 
cluster) , new cluster centers are defined by computing the 
mean vector for the data samples actually assigned to the 
cluster. After the initial split sequence, the iterative pro- 
cedure terminates when the user-specified sequence of splits and 
combinations is exhausted. See the SEQUEN control card. The 
criteria for splitting or combining a cluster are user-specified 
by the STDMAX and DLMIN control cards. 

After the final iteration, the covariance matrix for each cluster 
is computed and, at the user's option, is printed. All cluster 
statistics for the class are saved on a scratch file until all 


classes have been clustered , at which time the SAVTAP file is 
written. The chaining of clusters for the final map printout 
is performed, if the user has requested the option (see CHAIN 
control card, table 9-1) . Statistics for the chained clusters 
are not computed. 

The processor allows the user to control the amount of line- 
printer output he receives via the KRN and MAP control cards. 

A final map of the clustered data is always output along with a 
statistical summary of the clusters, which includes mean and 
standard deviation vectors, total points assigned to each cluster 
and intercluster distances. 

Optionally the user may (1) input initial cluster centers to 
hasten the clustering process or (2) allow the program to ini- 
tialize the process by assigning all the data to one cluster, 
obtaining the mean and standard deviation, and then splitting. 
Initial means may be input (1) by cards (see control card MEANS 
and Cluster Means Deck, section 3. 1.4. 3) or (2) by the SAVTAP 
file (see control card STATFILE) . Input of the initial means 
causes a scratch file to be written so that the means can be 
used repeatedly. Successive classes may or may not use the 
same means to initialize cluster centers for a new class. The 
control card MEANS allows the user to request cluster centers 
from the last class to be read from the scratch file and used 
as initial centers for a new class. Input of a new set of ini- 
tial means will cause the scratch file to be overwritten with 
new cluster co; bers. 



Several additions have been made ir. support of Procedure 1 require- 
ments. These additions are described in a general sense as follows 

a. Optionally, starting dots (pixels) from the dot data file on 
the DOTUNT unit may be used to begin clustering. 

b. The analyst may identify DO and DU pixel sets (fields) by field 
card input. The pixels in these fields are not included as 
inputs to the clustering algorithm. They are assigned 
special cluster numbers and mean vectors for display 
purposes . 

c. Using a Sun angle correction table, the pixel radiance values 
may be modified. (The correction table is built in.) The radi 
ance value correction applies only for clustering purposes. 

The user may input the Sun angles by cards or request that these 
angles be extracted from the header record of a Universal-formatted 
MSS data tape (ERIPS unload tape) . 

The clustering algorithm embodied in ISOCLS is detailed step by 
step in the following subsections. This entire procedure is 
repeated for each class (or data set) . 

See the functional flow chart for ISOCLS (fig. 9 - 1 ). 


9 . 1 PROCEDURES 

4 

9.1.1 NOTATION DEFINITIONS 


Symbol 

Fortran 

name 


Definition 

CLDij 

CLD ( I , J ) 

Intercluster distance between clusters i 
and j . 

d[X k ,u {l) ] 

DIST 

Distance 
center of 

from the data point k to the 
cluster i. 

DLMIN 

DLMIN 

Threshold 

value for combining clusters. 


TABLE 8-1.— Concluded 


Parameter and 

Keyword default values 

STATFILE UNIT=N,FILE=M 

Default: N=20 ,M=1 


HED1 Any 60 characters 

beginning in column 11 
Default: LYNDON B. 

JOHNSON SPACE CENTER 

HED2 Any 60 characters 

beginning in column 11 
Default: HOUSTON, 

TEXAS 


Function 

N is the logical Fortran unit 
number to which the SAVTAP 
file has been assigned; M is 
the file number on the tape 
to be processed. 

Replaces first header line 
with the indicated characters 
in the parameter field. 

Replaces second header line 
with the indicated characters 
in the parameter field. 


DATE Any 12 characters 

beginning in column 11 
Default: Current date 

COMMENT Any 60 characters 

beginning in column 11 
Default: No comments 

printed 

*END Blank 

$END Blank 


Prints the indicated 12 char- 
acters in the right corner of 
the heading in place of the 
current date. 

Prints a comment line using 
the 60 characters found in 
the parameter field. 

Signals the end of the control 
cards . 

Signals the end of all card 
input for the processing 
function. 


Symbol 

Fortran 

name 

Definition 

I STOP 

IS TOP 

Maximum number of initial split 



iterations . 

CHNTHS 

CHNTHS 

Chaining threshold value. 

LNCAT 

LNCAT 

INCAT 

Number of existing clusters at a given 
time. 

N (i) 

N{I) 

DN{I) 

Total number of data points assigned 
to cluster i. 

SEQUEN 

SEQUEN 

User-specified sequence of split and 
combine iterations. 

NMIN 

NMIN 

Minimum number of data points allowed 
per cluster for both the initial split 
iterations and for one through (NOSEQ- 1) 
SEQUEN iterations. 

PMIN 

PMIN 

Minimum of (PMIN+NOFEAT) number of 
data points allowed per cluster for 
the NOSEQtTz SEQUEN iteration. 

NOSEQ 

NOSEQ 

Maximum number of SEQUEN iterations. 

ISEQ 

ISEQ 

Numbe:* of SEQUEN iterations at a given 
time. 

NOFEAT 

NOFEAT 

Number of coordinates (channels) in a 



data vector. 

STDMAX 

STDMAX 

Threshold for splitting clusters. 

x k 

C(I,K) 

Data vector k, C (I,K)= (X^ f Xj , • • • , 
X NOFEA,T^k 


MEANS (J,I) 

Mean of the j th coordinate of the 
i th cluster. 


Symbol 


Fortran 

name 


Definition 


(i) 




(i) 


a : 
3 


(i) 


AVP ( J, I) 


AMN ( J , I) 


STDEV ( J, I) 


Temporary summing variable for the cal- 
culation of the standard deviation of 
the j th coordinate of the ith cluster. 

Summing variable for computation of 
new means. After all data have been 
assigned to clusters on each iteration, 
AMN ( J , I ) is the new mean of the j th 
coordinate of the i th cluster. 

Standard deviation of the j th coordi- 
nate of the ith cluster. 


9.1.2 INITIALIZING THRESHOLD VALUES 

Initialize threshold values for splitting clusters (STDMAX) , 
combining clusters (DLMIN) , and deleting clusters (NMIN and 
PMIN) . Then begin the following iterative procedure. 


9.1.3 ITERATIVE PROCEDURE 

9. 1.3.1 Classify and Calculate New Statistics 

Assign each data point to a cluster and at the same time collect 
the means, standard deviations, and point counts of the newly 
developing clusters. If zero clusters, set i = 1 and go to 
iteration b. If more than zero clusters, go to iteration a. 

a, Assign the data point X k = (X 1 ,X 2< ‘ • • ,* N0FEAT ) k to the 

ith cluster if d£x k ,y^J < d^X k ,y^J for all j ^ i, where 

d£x k ,ii^J is defined as 


N OFEA T 

d[x k ,u (i) ] = 2 

j=l 


x jk " 


(i) 


(9-1) 




b. 


N (i) = 

N (i) + 1 

(9-2) 

c . 

= 

3 

N (i) - 
N (i) 

— m! i} + frnrx., 
3 N (i ) 3 k 

(9-3) 

d. 

'!“■ 

N (i) - 
N (i I 

1 -y ( i ) 4 . 1 

Y j N(i) A 3 k 

(9-4) 

e . 


- M 1 ’! 

| 2 j 1/2 

f ; j = 1, NOFEAT 

(9-5) 


Return to step a and repeat iterations a through e until all 
data points have been classified. 


9. 1.3. 2 Delete Clusters 

For the initial split iterations and one through (NOSEQ-1) 
user-specified SEQUEN iterations, delete all clusters which have 
fewer than NMIN members. For the NOSEQtfc user-specified itera- 
tion (last user-input sequence) , delete all clusters which have 
fewer than. PMIN members. A cluster is deleted simply by removing 
the statistics for that cluster and reducing the number of clus- 
ters (specified by LNCAT) accordingly. 

9. 1.3. 3 Test for Completion 

If this is not the last iteration, proceed to 9. 1.3. 4. If this 
is the last iteration and no clusters were deleted, the procedure 
is finished. If one or more clusters were deleted for having 
less than PMIN members, go back to 9. 1.3.1 and reassign the data 
to the clusters obtained from iteration (NOSEQ-1) . 

9. 1.3. 4 Determine Type of Iteration 

Determine whether this is to be a split iteration or a combine 
iteration and proceed to the appropriate step. 



The sequence of iterations will be as follows: 


SSSS CCSCSC 

ISTOP SEQUEN 
and/or 
PERCENT 

where 

S = split iteration 
C = combine iteration 

The beginning sequence of split iterations is terminated either 

(1) when the standard deviations for the user-input percentage 
of clusters (see PERCENT control card, table 9-1) are less than 
the STDMAX threshold parameter or (2) when ISTOP iterations have 
been reached. At that point, the type of iteration (split or 
combine) and number of iterations (NOSEQ) are determined by the 
SEQUEN parameter. 

The initial split iterations are for the automatic initializa- 
tion of cluster centers in the event they are not input. The 
sequence is shortened considerably if initial cluster centers 
are input. 

9. 1.3. 5 Split Clusters 

A cluster is split along the j th coordinate (1) if the j th 
coordinate has the maximum standard deviation for the cluster, 

(2) if the standard deviation along the j th coordinate is 
greater than the STDMAX threshold parameter, and (3) if the 
cluster has more than 2(NMIN+1) data points. 


If conditions (1) through (3) are met, two new clusters are 
created and the parent cluster is deleted. A cluster is created 
merely by defining its centers (means) for each coordinate. If 
the i th cluster is split in the j th coordinate, the two new 
clusters will have centers at |*y|^ , 


rU 2 l) '** ‘/Hj 1 * 


± a, • • • ,y 


(i) 

NOFEAT 


]■ 




where a will normally be but can be a user-input constant 

(see SEP control card) . On a given split iteration, if the 
maximum number of clusters (CLUSTER) has not been reached, all 
clusters having a standard deviation greater than the STDMAX 
parameter will be split. To ensure that the clusters with the 
largest standard deviations receive the highest priority for 
splitting, when 2 x LNCAT > CLUSTER, the standard deviations 
are ordered along the j th coordinate in descending order. 
Return to 9. 1.3.1 after splitting clusters. 


9. 1.3. 6 Combine Clusters 


Two clusters are combined if the distance between them is less 
than the DLMIN threshold parameter. The distance between 
clusters i and j is calculated as 


CLUij 


/NOFEAT 
\ k"l 



- ^ j) 
c (i V3> 


, 1/2 


(9-6) 


If CLD^ j < DLMIN and CLD^ = MIN (CLD^ ^ ) for all i = 1, LNCAT and 
j = 1, LNCAT for all i ^ j, clusters i and j will be merged to 
form a new cluster L with means 


u 


(L) ^ 
k 


Nii)^ 1 ’ + N(j)u^ j) 
N (i) + N ( j ) 


k = 1, NOFEAT 


(9-7) 


The clusters i and j are deleted. The new cluster L is not 
considered as a candidate for merging with any other cluster 
on the iteration in which it was formed. Return to 9. 1.3.1 
after combining clusters. 


9.1.4 CHAINING 

A final optional step in the clusterin'/ procedure groups all 
clusters which have intercluster distances less than the chain- 
ing threshold (CHNTHS) to form one cluster. The chaining pro- 
cedure was adopted because the minimum variance criterion used 



in the iterative procedure above tends to group the data into 
spherical (or ellipsoidal) groupings with Gaussian distributions. 
This type of grouping is certainly a natural grouping and would 
quite often be completely satisfactory. 

Some natural groupings of the data are odd shaped and cannot be 
approximated by Gaussian distributions. Two examples are given 
in figure 9-2. At the end of the sequence of split and combine 
iterations, groupings of the type in figure 9-2 are likely to be 
separated into subclusters as illustrated in figure 9-3. The 
chaining algorithm will indicate that the subclusters 1, 2, and 
3 (fig. 9-3) could be grouped into one composite cluster; 
likewise, subclusters 4, 5, 6, and 7 could be grouped together 
to form one cluster. 

The algorithm scans the intercluster distance table (CLD) and 
begins a chain with the first appearance of two clusters within 
a distance of CHNTHS units. Once a subcluster is in the chain, 
all clusters which are within CHNTHS units of the subcluster 
are added to the chain. See figure 9-4. 

The statistics (means, standard deviations, and covariance 
matrices) of the clusters resulting from chaining are not cal- 
culated by the program because, in many cases, the chained 
cluster cannot be represented by a Gaussian distribution. 

There are, of course, instances where one can safely combine 
those subclusters that are chained by the program into one com- 
posite (Gaussian) cluster. For example, subclusters 1, 2, and 
3 in figure 9-5 can safely be combined into one final cluster. 
This is indicated by the fact that, pairwise, these three sub- 
clusters are all close to one another. In this case, the fol- 
lowing formulas (ref. 3) can be used iteratively to compute the 
composite statistics. 



Assuming that two clusters (n^,m^,C^) and (n 2 ^2 ,C 2^ are to 

considered as one cluster (n,m,C) , where all n, m, and C are 

the number of points , the mean vectors , and the covariance 

T 

matrices, respectively, and m is l 'ie transpose of m then 
n = n^ + n 2 

m ■ (*r^h + 

° ■ (n^Tnjh + (-T^nj) C 2 + 

. / n 2 \ T T 

+ ( n x V - n 2 j m 2 m 2 " ““ 

9.2 INPUT FILES 

An MSS data tape (DATAPE) must be input to the ISOCLS processor. 
The tape assignment defaults to Fortran unit 11; however, by 
input of the DATAFILE control card, the user may assign any 
available logical unit. (See table 5-1 for file assignments 
and section 3.2, MSS Image Data Tapes , for further informa- 
tion on format.) 

In support of Procedure 1, the following inputs are also required 

a statistics file (SAVTAP) to provide starting cluster mean vec- 

1 

tors; initial cluster "centers" on cards; and the starting dots 
from the DOTUNT file to initialize the cluster processing. 

Format descriptions of these files are included in sections 4.1 
and 3 . 1 . 4 . 3 . 




9.3 OUTPUT FILES 


Statistics are output by ISOCLS to the SAVTAP file (section 4.1) 
The file assignment defaults to logical unit 20; but, by input 
of the STATFILE control card, the user may assign any available 
logical unit. (See table 5-1 for file assignments and STATFILE 
control card, table 9-1, for further information.) 

A cluster map tape (MAPUNT) may be generated optionally for dis- 
playing the results of the clustering on the Bendix 100 or the 
PMIS DAS. The FORMAT control card initiates the option and 
names the desired format of the tape. Logical unit 16 should 
be assigned to a nine-track tape drive when this option is 
exercised (see section 5.1). 

A printout of the cluster results consists of the following data 
items by class: cluster numbers and symbols; cluster mean vec- 

tors (by channel) ; cluster standard deviations by channel; inter 
cluster distances; number of pixels per cluster; number of 
clusters; and cluster map by field for each class. 

9.4 SCRATCH FILES 

The program dynamically assigns random access disk storage for 
scratch files. ISOCLS uses the disk storage for temporary 
storage of cluster statistics, the data to be clustered, and 
the classification of each pixel. 


9.5 CARD INPUT 
9.5.1 PROCESSOR CARD 

The processor keyword is left justified starting in column 1. 
For example, 



$ISOCLS 


This card directs the monitor routine to call the ISOCLS proc- 
essor and causes all routines used by the ISOCLS processor to 
be loaded into the system. 

9.5.2 SYSTEM CARD DECKS 

The processor will read a cluster MEANS deck in the format 
defined in section 3. 1.4. 3. The deck may be used to initial- 
ize cluster centers for the clustering procedure. 

9.5.3 CONTROL CARDS 

Control cards allow the user to input various options. They are 
identified by a keyword that is left justified in columns 1 
through 10 of the card, with parameter values or additional 
keywords in columns 11 through 72. These control cards may be 
in any order, but they must be the first cards after the proc- 
essor card $ISOCLS. Table 9- 1 lists all available options, 
along with their parameter values. 


9.5.4 CLASS AND FIELD DEFINITIONS 

A CLASSNAME card, followed by at least one field definition 
card, must immediately follow the *END control card. The for- 
mats for these cards are defined in section 3.1.3. 

The pixels from all fields for one class are extracted from the 
MSS data tape (DATAPE) and stored on a disk file. The data from 
all fields for one class are clustered as one data set. The 
statistics for all clusters in that class are saved on a scratch 
file, and the next class is clustered. When all classes have 
been clustered, the statistics are written on the SAVTAP file. 
The SETUP routine may be entered after each class to change 
parameter values. The input for definition of classes and 
fields is explained as follows. 



5 


Write the SAVTAP file 
after five classes have 
been clustered. 


WHEAT 1 clustered as 
one data set. 


Enter SETUP routine 
again for new parameter 
values. 


Now write statistics 
file. 


I 

I 


I 

I 


} 


$ISOCLS 

CLASSES 


I • 

j ; (Other control cards) 
*END 

( CLASSNAME WHT1 

) FIELD 1 I 

1 j FIELD 2 

' FIELD3 ♦ 

( CLASSNAME WHT2 

2 FIELD4 I 

( FIELD5 ♦ 

— ♦ $END 

j . 

; (New parameter values) 


*END 

/ CLASSNAME NWH1 

. ) FIELD6 
J i FIELD7 
* FIELD8 

. f CLASSNAME NWH2 

q ( FIELD9 ♦ 

- (CLASSNAME NWH3 

b ( FIELD10 f 

— $END 


$ (Next processor) 


Note that actual names may not exceed four characters. 

ISOCLS recognizes DO/DU fields. All the DO/DU field cards (for 
all classes) must be input before the fields to be clustered. 
These fields must immediately follow the *END card. The 
CLASSNAME card follows the last DO/DU field card. 


Example : 


~9 If DO/DU fields are being defined: 


*END 

DESIGNATED 

OTHER 

DESIGNATED 

UNIDEN 

CLASSNAME 


OTHER 

(1.1) , (1,1) , (40,1) , (40,20) , (1,20) 
UNIDENTIFIABLE 

(1.1) , (5,7) , (8,7) , (8,10) , (5,10) 
WHT 







WHT1 (1,1) , (1,1) , (196,1) , (196,117) , (1,117) 

$END 

• If no DO/DU fields are being defined: 

*END 

CLASSNAME WHT 

WHl (1,1) , (1,1) , (196,1) , (196,117) , (1,117) 

$END 


9.6 CARD OUTPUT 

A module STAT file (see section 3. 1.4.1) may be punched and used 
as an interface between ISOCLS and SELECT or CLASSIFY. This 
option is exercised via the OPTION PUNCH control card. 


9.7 RESTRICTIONS 

The ISOCLS processor uses disk for a temporary scratch file. 
There are approximately "50 000 words of storage available on 
disk. The data to be clustered for one class are stored on this 
file, along with other information. To compute the maximum 
number of pixels per cletss, use the following formula. 


Maximum 

pixels 


(number of T 

/number of' 

2 ,-i 

\ + 3 /number of \ 

( classes [ 

,\ channels , 

) \ channels /J 


number of channels + 1 


(9-9) 

The maximum number of clusters per class is 60, and the maximum 
number of channels is 30. The covariance matrices for all 
clusters in one class must be stored in core at one time. They 
are stored in an array dimensioned 11 5Q0. The following formula 
may be used to see if enough storage is available for the 
covariances. 


11 500 > 


number of 
clusters 


number of 
channels 


( number of 
channels 
2 





(9-10) 



9.8 DIAGNOSTIC MESSAGES 


9.8.1 SUBROUTINE DSTAPE 

Message 

THE NUMBER OF CHANNELS TIMES 
THE NUMBER OF SAMPLES HAS 
EXCEEDED 1,1500. DECREASE THE 
NUMBER OF CHANNELS OR THE 
NUMBER OF SAMPLES. TERMINAT- 
ING RUN FROM DSTAPE. 

9.8.2 SUBROUTINE ISOCLS 

Message 

a. NO. CHANNELS FOR 
STARTING NOT EQUAL THAT 
FOR CLUSTER 

b. DIMENSION LIMITS EXCEEDED 

IN ISOCLS BY . REDUCE 

CHANNELS OR MAX. CLUSTERS. 

C. DIMENSION LIMIT OF FOR 

COVARIANCES EXCEEDED. 

9.8.3 SUBROUTINE PSPLIT 

Message 


Explanation 

Storage available has been 
exceeded. 


Explanation 

The number of channels of 
starting vectors from the STAT 
file must equal the number of 
requested data channels . 

The user has exceeded storage. 
The number of channels or maxi- 
mum clusters per class should 
be reduced. 


Explanation 


ERROR READING DISK-ISTAT=XXXX 


9.8.4 SUBROUTINE RANK 


Message 

THE NUMBER OF CHANNELS ARE NOT 
A MULTIPLE OF 4. THE COLOR 
KEYS WILL BE ORDERED BY 
CLUSTER NUMBER. 


Explanation 

Currently used greenness 
brightness transformations 
require four channels per pass. 


9.8.5 SUBROUTINE RDDATA 
Message 

a. TOO MANY DO OR DU FIELDS. 
THESE IGNORED. 

b. TOO MUCH DATA REQUESTED — 
PIXELS* (CHANNELS +1) 

CANNOT EXCEED XXXXXXXXXX • 

c. STORAGE REQUIRED FOR FIELD 

DEFINITION INFORMATION 
EXCEEDS THE DIMENSION 
LIMIT OF . 

d. END-OF-TAPE REACHED BEFORE 
END-OF-FIELD. 

e. INPUT ERROR - A CLASSNAME 
CARD MUST EE INPUT BEFORE 
A GROUP OF FIELDS. 

f. NO. OF PIXELS TO BE UNPACKED 

PER SCAN EXCEEDS THE DIMEN- 
SION LIMIT OF . 

g. TOO MUCH DATA REQUESTED — 
PIXELS * (CHANNELS + 1) 

CANNOT EXCEED 


Explanation 

There can be up to 10 DO fields 
and 10 DU fields. 

Self-explanatory. 

Reduce the number of fields. 

All vertices, names, and 
rectangular coordinates are 
saved for each field. The user 
has exceeded storage. 

A field has been defined beyond 
the limits of the MSS DATAPE. 

See section 9.5.4 on defining 
classes and fields. 

Decrease the number of channels 
or pixels per scan in the field. 

Disk file will not hold all of 
the data for one class. Reduce 
channels or size of fields. 


JK16 

u 


9.8.6 SUBROUTINE RDMEAN 


Message 

a. MEANS FOR CHANNEL XXXX ARE 
NOT ON FILE — DUMMY VALUES 
WILL BE USED. 

9.8.7 SUBROUTINE SETUP 7 : 
Messag e 

a. ERROR ON CHANNEL CARD. 

b. ERROR ON DATAFILE CARD. 

C. ERROR ON STATFILE CARD. 

d. ERROR ON DOTFIL CARD. 

e. INVALID INPUT CARD 

IGNORED . 


f. CHANNELS CANNOT BE CHANGED 
UNTIL THIS EXECUTION OF 
ISOCLS IS COMPLETED. 


g. NO. OF CLASSES CANNOT 
BE CHANGED UNTIL THIS 
EXECUTION OF ISOCLS IS 
COMPLETED . 


Explanation 
Self-explanatory . 


Explanation 

Check format. 

Check format. 

Check format. 

Check format. 

Check table 9-1 for correct 
spelling of keywords for card 
input and make sure the key- 
word is left justified in the 
field. 

The channels to be used should 
be set in the first set of con- 
trol cards input after the 
ISOCLS card. That set of 
channels will be used for all 
classes. If the user attempts 
to input a CHANNELS card into 
the SETUP routine on a later 
entry, the card will be ignored. 

The number of classes to be 
clustered must be input only 
in the first set of control 
cards input after the ISOCLS 




card. If the user attempts 
to change this parameter , the 
input will be ignored. 

**WARNING** NMIN IS LESS NMIN should be increased to 
THAN NO. OF CHANNELS, greater than total number of 

COVARIANCES? WILL NOT BE channels. 

INVERTIBLE. 




TABLE 9-1.- ISOCLS PROCESSOR OPTIONS AND CONTROL CARDS 


Keyword 

(a) 


Parameter and 
default values 
(b) 


CHANNELS DATA=C ,C 2 ,C 3 , • • • ,C k , 

STAT=A 1 ,A 2 /A 3 , • • • ,A k 
k<number of channels 
on SAVTAP< 3 0 
Default: None 


Function 

C ' s are integer channel numbers 
that (1) will be used in clus- 
tering and (2) refer to the MSS 
data tape (DATAPE) . A's are 
integer channel numbers that 
(1) will be the starting vec- 
tors (initial means) , (2) refer 
to the SAVTAP file, and (3) must 
be a subset of the channels on 
the SAVTAP file. The same 
channels must be used through- 
out one execution of ISOCLS. If 
a cluster MEANS card file is in- 
put, the channels on this card 
must be a subset of the chan- 
nels in the MEANS card file. 


OPTION ORDER 

Default: The color 

keys will be ordered 
according to cluster 
numbers . 

OPTION PUNCH=N 

Default: If PUNCH is 

omitted, no cards are 
punched; if N is omit- 
ted, it defaults to 1. 


The color keys on the MAPUNT 
tape will be ordered according 
to greenness. See section 5.1 
for further details of color 
keys . 

Punches the means and covar- 
ance matrix for each cluster 
in the module STAT card file 
format defined in sec- 
tion 3. 1.4.1. N=1 punches mod- 

ule STAT card file; N=2 punches 


a 

The keyword must be left justified in card columns 1 through 10. 
b 

The parameter and default values are in card columns 11 through 72 (beginning 
in any column past 10) . 


XtT 

f /' 7 


TABLE 9-1.— Continued 


Keyword 

OPTION 

SEQUEN 

OPTION 

SYMBOLS 

FORMAT 

FORMAT 

OPTION 


Parameter and 
default values 


STATS 

AA* • »A 
Default: SC 


ERCOMP 


Functio n 

ERIPS interface card file; and 
N=3 punches both card files. 

Prints the covariance matrix 
for each cluster. 

A represents the sequence of S 
and C characters used for iter- 
ation control after the initial 
split sequence. A maximum of 
19 characters -may be input. 

Prints an error criterion for 
each iteration. 


S 1' S 2' S 3' * 

Default: 1,2, •••9, 

A,B, • • • Z , % , # , A/ 

f 

, ; , , ' , comma , period , 
blank , 

UNIVERSAL 
Default: Output 

MAPUNT tape is not 
generated. 

LARSYS 

Default: Output 

MAPUNT tape is not 
generated. 

CLUSTER 

Default: If the 

FORMAT control card 


Symbols used to identify 
clusters in the printout. 


Generates the output cluster 
MAPUNT tape in Universal for- 
mat (see section 5.1 for 
further information) . 

Generates the output cluster 
MAPUNT tape in LARSYS format. 


The output cluster MAPUNT tape 
will contain the cluster number 
to which the corresponding 
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TABLE 9-1.— Continued 


Keyword 


NMIN 


DLMIN 


PMIN 


SEP 


DATAFILE 


Parameter and 
default values 

is input, the output 
cluster MAPUNT tape 
will contain the 
mean vector of the 
cluster to which the 
corresponding pixel 
was assigned. 

N 


X 

Default: 3.2 


Function 

pixel was assigned. When 
selecting this option, the 
FORMAT control card must be 
input also. 


Deletes any cluster with fewer 
than N members on the first 
through next-to-last iteration 
(see section 9.1.1). 

On a combine iteration, com- 
bines any two clusters wh se 
means are closer than X units. 


N 


X 

Default: Maximum of 

the channel standard 
deviations in the 
cluster 


Deletes any cluster with fewer 
than N members on last itera- 
tion (see section 9.1.1). 

When splitting a cluster, 
separates the new clusters by 
a distance of X units. 


UN IT=N , F ILE=M N is the Fortran logical unit 

Default: N=11,M=1 number to which the MSS data 

tape (DATAPE) has been assigned 
M is the file number on the 
tape to be processed. For 
back-to-back executions , if 




TABLE 9-1.— Continued. 


Keyword 


STATFILE 


ISTOP 


PERCENT 


Parameter and 

default values Function 

the same data file is to be 
processed throughout the execu- 
tion, only one DATAFILE card 
need be submitted. 


INPUT/UNIT=N , FILE=M , 
OUTPUT/UNIT=L,FILE=S 
Default: No defaults 

for INPUT; L=20,S=1 
for OUTPUT. 


N is the Fortran logical unit 
number to which the SAVTAP 
file containing the initial 
means has been assigned; M is 
the file number of the tape 
to be processed; L is- the 
Fortran logical unit number 
to which the SAVTAP file con- 
taining the generated statis- 
tics will be output; S is 
the file number on the tape 
for saving the clustered 
statistics. (Input of initial 
means from SAVTAP file is 
illustrated in section 16, 
figure 16-4.) 


N 

Default: 10 


A maximum of N iterations is 
performed in the initial split 
sequence. 


N 

Default: 80 


N, an integer number, is the 
test variable for the percent- 
age of stabilized clusters with 
standard deviations less than 
the threshold parameter STDMAX 
in the initial split iteration 
sequence. 



Keyword 

STDMAX 

CLASSES 

CLUSTERS 

KRN 

MAP 

CHAIN 

SUBCLASS 


TABLE 9-1.- Continued. 


Pa"ameter and 

default values Function 


X 

Default: 4.5 


On a split iteration, splits 
any cluster whose maximum 
standard deviation is greater 
than X units. 


N 

Default: 1 

N 

Default: 60 

N 

Default: 20 

N 

Default: 20 


Number of classes to be clus- 
tered (see section 9.5.4 for 
defining classes) . 

Maximum number of clusters per 
class; N must be <60. 

Prints out a summary of the 
clusters at every N th iteration. 

Prints out a map of the clus- 
tered data along with the sum- 
mary for every Nth iteration. 

A final cluster map is printed 
regardless of this parameter. 


X Chains all clusters within X 

Default: Chaining not units of each other to form 

performed one cluster. Chaining of clus- 

ters affects only the final map 
printout and MAPUNT tape. 


C 1' C 2' C 3' * * * ' C k 
k<60 

Default: All sub- 

classes/clusters on 
SAVTAP file will be 
used in initializing 
the clustering. 


C's are integer subclass 
or cluster numbers that 
(1) will be used in _he ini- 
tial means, (2) refer to the 
SAVTAP file, and (3) must be 
a subset of the subclasses or 
clusters on the SAVTAP file. 
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TABLE 9-1.— Continued 


Keyword 

MODULE 

MEANS 


MEANS 


DOTFILE 


DOTS 


Parameter and 
default values 

Blanks 


CARDS 

Default: Clustering 

procedure is automati- 
cally initialized if 
this deck or MEANS 
file is not input. 

FILE 

Default: Cluster cen- 

ters are automatically 
initialized if this 
card or the MEAN card 
deck is not input. 

INPUT/UNIT=n,FILE=m 
Default: Self- 

initializing starting. 


n l ' n 2 / * * * ' n 60 
Default: Dots will not 

be used for starting 

vectors . 

TAPE 

Default: No sun angle 

correction applied. 


Function 

Initializes the reading of the 
module STAT card file that imme- 
diately follows this card. 

Initializes input of the clus- 
ter MEANS deck defined in 
section 3. 1.4. 3. This deck 
is used to initialize cluster 
centers for the clustering 
procedure . 

Indicates means for initial 
clusters have been input pre- 
viously from cards and stored 
on file. The same initial 
means are to be used again for 
initializing the process for 
a new data set. 

Defines the Fortran unit num- 
ber n and file number m of the 
dot data file (DOTUNT unit) - 
containing the starting 
vectors . 

n^ are integer numbers sepa- 
rated by a comma specifying 
the dots to be used as start- 
ing vectors. 

Sun angles are extracted from 
the ERIPS unload MSS data 
tape . 




SUNANG 


TABLE 9-1.— Continued. 


Parameter and 

Keyword default values 

SUNANG n l' n 2 ' * ’ * ' n j 

n^ are integer 
numbers, j<8. 

Default: No sun 

angle correction 
applied. 

HED1 Any 60 characters 

beginning in column 11 
Default: LYNDON B. 

JOHNSON SPACE CENTER 

HED2 Any 60 characters 

beginning in column 11 
Default: HOUSTON, 

TEXAS 

DATE Any 12 characters 

beginning in column 11 
Default: Current date 

COMMENT Any 60 characters begin- 

ning in column 11. 
Default: No comments 

printed. 

*END Blank 


Function 

n^ are the sun angles to be 
used in computing the sun 
angle corrections for use in 
the clustering algorithm. A 
sun angle must be input for 
each set of 4 channels input 
on the CHANNEL control card. 

Replaces first header line 
with the indicated characters 
in the parameter field. 

Replaces second header line 
with the indicated characters 
in the parameter field. 

Prints the indicated 12 charac- 
ters in the right corner of the 
heading in place of the cur- 
rent date. 

Prints a comment line using 
the 60 characters found in 
the parameter field. 

Indicates the end of control 
cards . 
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TABLE 9-1.— Concluded 


Parameter and 

Keyword default values 


Function 


$END Blank Indicates the end of all 

classes to be clustered for 
this set of control cards. 
The SETUP routine will be 
reentered to read new con- 
trol cards for the next 
class until all classes have 
been clustered. 


EOO-LARSYS 
HON l TOR ROUTINE 


PARAMETER and 
DEFAULT VALUES 


PRINT FINAL ‘CLUSTER 
SUMMARY AND HAP 


STORE CLASS DATA 


DETERMINE CLUSTERS FOR THIS 1 
CLASS. COMPUTE MEANS, STAND- ■ 
ARD DEVIATIONS, AND INTER- 
CLUSTER DISTANCES. APPLY SUN 
ANGLE CORRECTION IF REQUESTED: 


PRINTOUT 

REQUESTED 


PRINT COVARIANCE 
MATRICES 


if f ,A v * I'Mll) IS 

1 *.n vf.ny 



BRANCH TO 
ANOTHER PROCESSOR 




FIND CLUSTER CHAINS 


PRINT SUMMARY 




SETUP 

READ PARAMETER VALUES 
AMD SET DEFAULTS 


READ DATA FOR ONE CLASS 
INCLUDING DO/DU 





COMPUTE COVARIANCE 
MATRICES 


Figure 9-1.— Functional flow chart for the ISOCLS processor 
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(a) The boomerang-shaped 
cluster. 


(b) The donut-shaped 
cluster. 


Figure 9-2.— Odd-shaped clusters. 



(a) Subclustering of the 
boomerang-shaped cluster. 


(b) Subclustering of the 
donut-shaped cluster. 


Figure 9-3.— Breaking up of the clusters into subclusters. 




(a) Cluster structure. 
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(b) Intercluster distance table 
Figure 9-4.— Example of chaining 



CHNTHS = 3.2 



Figure 9-5.— Example in which chained subclusters can 
be combined safely into one composite cluster. 
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10. FEATURE SELECTION PROCESSOR - SELECT 


The feature selection processor SELECT provides a means of 
measuring the relative importance of the individual channels 
and obtaining the set of channels which provides the best dis- 
crimination between subclasses. The processor allows the user 
to choose one of the following three criteria for measuring the 
separability of the subclasses for a set of channels or for 
linear combinations of the channels. 

• Weighted average interclass divergence 

• Weighted average transformed divergence 

• Weighted average Bhattacharyya distance 

Either the Exhaustive Search or the Without Replacement Procedure 
can be used with one of the above criteria to select a "best" 
set of channels. The Exhaustive Search Procedure determines 
the best set of k out of n channels by computing the separa- 
bility measure for every possible combination of k channels. 

This results in n!/k! (n - k ) l computations of the separability 
measure. The computer time required for this procedure is pro- 
hibitive for a xarge n. In such cases, the Without Replacement 
Procedure could be used. 

The Without Replacement Procedure determines the best k out of 
n channels in the following manner. First, the single channel 
which extremizes the separability measure is selected. Each of 
the remaining (n - 1) channels are paired with the best single 
channel in selecting the best pair of channels. The best triplet 
is determined by combining the remaining (n - 2) channels with 
the best pair. The process continues until the best set of 
k channels has been selected. The number of times the separa- 
bility measure must be computed is n + (n - 1) + (n + 2) + ••• 

(n - k + 1) . 




A third procedure, the Davidon-Fletcher-Powell Procedure, is a 
powerful iterative descent method for finding a local minimum of 
a function of several variables. The procedure is discussed in 
reference 6. How the procedure applies to the problem of channel 
selection or dimensionality reduction is discussed in reference 7. 
In SELECT, the Davidon-Fletcher-Powell Procedure computes a 
k-by-n linear transformation matrix which extremizes a given 
separability measure. This matrix, referred to as the B-matrix, 
is saved on the BMFILE (section 4.2) and optionally is punched on 
cards (B-Matrix file, section 3. 1.4. 2) for later input to the 
CLASSIFY, SELECT, TRSTAT , SCTRPL, or DATA-TR processors. 

An initial guess for the B-matrix must be provided for the 
Davidon-Fletcher-Powell routines and may be input via the 
B-matrix file on cards or BMFILE. If the initial guess is not 
provided by the user, SELECT will execute the Without Replace- 
ment Procedure first to obtain a best set of channels, which it 
will use to initialize a first-guess B-matrix for the Davidon- 
Fletcher-Powell Procedure. 

In addition to selecting a best set of channels and/or linear 
combinations, the processor will evaluate any one of the three 
separability measures for a specified linear combination of the 
channels. The linear combination must be input via the B-matrix 
deck or the BMFILE if SELECT has been executed previously in the 
same run. This option is the fourth procedure defined under the 
PROCEDURE control card. 

The processor will also evaluate ai of the separability measures 
for specified sets of channels. This request is made using the 
EVALUATE and PROCEDURE control cards. This is the fifth option 
defined under the PROCEDURE control card. 


For Procedure 1 applications, the SELECT processor optionally 
provides an option of an automatic assignment of interclass 
weights. The weights for class^, class^ = 1.0 for i =/ j and 
the weights for class., class- = 0.0 for i = j. The breakdown 

*•’ J 

of class^, class j pairs into the correct set of intersubclass 
pairs is provided by the processor. Information concerning the 
class-subclass association is extracted from the input statis- 
tics file SAVTAP. 

See the functional flow chart for the SELECT processor 
(fig. 10-1) . 

10.1 INPUT FILES 

The SELECT processor requires the statistics output from either 
STAT or ISOCLS. Both STAT and ISOCLS write the SAVTAP file and 
optionally punch the module STAT file on cards (see sec- 
tion 3. 1.4.1 for format) which may be used as input to SELECT. 

10. 2 OUTPUT FILES 

The BMFILE is output by SELECT when the Davidon-Fletcher-Powell 
Procedure is used (see appendix I for sample execution) . The 
file is written on logical unit 10. 

The corresponding B-matrix card file is punched if the OPTION 
PUNCH control card is included in the deck setup. 

10.3 SCRATCH FILES 

Disk files are used as scratch files in SELECT. No assignment 
is necessary. 

10.4 CARD INPUT 

All system card input formats referred to in this section are 
defined in section 3. 

fty 


10.4.1 PROCESSOR CARD 


The processor keyword is left justified starting in column 1. 

For example, 

$SELECT 

This card directs the monitor routine to execute SELECT and 
initiates loading of routines used by SELECT. 

10.4.2 SYSTEM CARD FILES 

The processor will read and process the module STAT file and the 
B-matrix file. 

10.4.3 CONTROL C^RDS 

Table 10-1 lists the control cards which are recognized by 
SELECT. 


10.4.4 FIELD DEFINITIONS 

Field definitions do not apply to the SELECT processor. 

10.5 CARD OUTPUT 

SELECT outputs the B-matrix file on cards as an option (see 
control card OPTION PUNCH) . This is optional output only when 
the Davidon-Fletcher-Powell Procedure is executed. 

10.6 RESTRICTIONS 

The system-related restrictions in section 19 apply to the 
SELECT processor. 

Two large arrays are dimensioned in SELECT and used for the 
variable dimensioning of several smaller arrays. Storage in 




one array is a function of the number of subclasses and channels 
requested. That is, 


/ 

'Number of + \ 


_ 


_ 

Number ofl 

channels I 

Number of 

Number 

1 Number 

3 \ 

channels \ 

2 /J 

subclasses 

of best 

^of best 

)\ 


Number of 
subclasses 


/ Number of \/ Number of + A 1Q 60Q 
\subclasses/ \subclasses / 


( 10 - 1 ) 


Storage requirements in the other array are dependent on the 
procedure and criterion being used. The Davidon-Fletcher-Powell 
Procedure requires much more storage than the other procedures, 
and the weighted average interclass divergence requires more 
storage than the other criteria. Requirements for each are 
as follows. 


10.7 DIAGNOSTIC MESSAGES 

10.7.1 SUBROUTINE AVEDIV 

Message 

a. REDUCED COVARIANCE MATRIX 
FOR CLASS XXX IS NOT 
POSITIVE DEFINITE. 

b. MORE STORAGE NEEDED IN 
SUB. AVEDIV FOR WORK 
ARRAY-WORK SIZE=XXXXXXX . 

10.7.2 SUBROUTINE BHTCHR 

Message 

a. COVAR FOR CLASS XXX IS NOT 
POSITIVE DEFINITE. 


Explanation 

Check subclass/cluster statistics 
for singularity. 

Storage inadequate; adjust 
parameters. 


Explanation 

Check subclass/cluster statistics 
for singularity. 



Message 


b. COVAR FOR SUM OF CLASSES 
XXXX,XXXX IS NOT POSITIVE 
DEF. 

C. NOT ENOUGH WORK AREA AVAIL- 
ABLE IN BHTCHR — IWRKS Z = 
XXXXX . 

10.7.3 SUBROUTINE BSTCHK 

Message 

a. AA0760 "BEST" XXX IS 
GREATER THAN OR EQUAL TO 
NO. OF FEATURES IN 
GIVEN DATA. . . IGNORED. 

b. INVALID EVALUATE 
REQUEST. . . 

10.7.4 SUBROUTINE DAVDN1 

Message 

ERROR ON DISK FILE - 

SUBROUTINE DAVDN1 ISTAT 

= XXX. 

10.7.5 SUBROUTINE DAVDN2 

Message 

MINIMUM IS AT ORIGIN - PROGRAM 

CANNOT CONTINUE. 


Explanation 
Same as above. 


Storage inadequate; adjust 
parameters . 


Explanation 

The channels included in "Best" 
mast be a subset of the input 
channels . 

The channels whose separabili- 
ties are to be evaluated must 
be a subset of total input 
channels . 


Explanation 

Tape hardware read error. 


Explanation 

Davidon-Fletcher-Powell itera- 
tion has reached an unusable 
minimum. 


10.7.6 SUBROUTINE DAVDN3 


Message Explanation 

*DAVDN3* — EITHER SIGYI OR YHY Davidon-Fletcher-Powell itera 
TOO CLOSE TO ZERO TO UPDATE tion cannot continue. 

H MATRIX-- SIGYL=XXXXXXX . 

XXXXXXX, YHY=XXXXXXX . XXXXXXX . 


10.7.7 SUBROUTINE DAVIDN 

Message 

a. , NOT ENOUTH WORK AREA AVAIL- 

ABLE IN DAVIDN — IWRKS Z= 
XXXXXX.. 

b. ERROR ON DISK FILE - 

SUBROUTINE DAVIDN 

LSTAT=XXX . 

10.7.8 SUBROUTINE DIVERG 

Message 

a. COVAR FOR CLASS XXXX IS 
NOT POSITIVE DEFINITE. 

b. NOT ENOUGH WORK AREA 
AVAILABLE IN DIVERG -- 
IWRKS Z =XXXXX . 

10.7.9 SUBROUTINE GTS TAT 

Message 

NOT ENOUGH WORK AREA IN 
GSTAT — IWRKS Z=XXXXX. 


Explanation 

Storage inadequate; adjust 
parameters . 

Read error on disk file. 


Explanation 

Check subclass/cluster 
for singularity. 

Storage inadequate; adjust 
parameters . 


Explanation 

Storage inadequate; adjust 
parameters . 


J s^f 
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10.7.10 SUBROUTINE SELECT 
Message 

a. ERROR IN INPUT CHANNELS. 

b. CORE OVERFLOW IN SUBRAY - 
NN - STORAGE LOCATIONS 
NEEDED FOR THIS PROBLEM. 


C. CORE OVERFLOW IN ARRAY - 
NN*2 - STORAGE LOCATIONS 
NEEDED FOR THIS PROBLEM. 


10.7.11 SUBROUTINE SETUP 4 
Message 

a. ERROR ON STATFILE CARD. 

b. TOO .MANY EVALUATE 
REQUESTS — REMAINDER 
IGNORED . 


C. GROUP CARD IN ERROR - 
IGNORED . 

d. PROGRAM CANNOT PROCESS 
LESS THAN 2 CLASSES. 

e. PROGRAM CANNOT PROCESS 
LESS THAN 2 CLASSES. 

f. INVALID CONTROL CARD - 
IGNORED. 


Explanation 

User might reduce the number of 
subclasses or channels or try 
another procedure. The SUBRAY 
array is used for temporary 
storage in SELECT only. (See 
restrictions, section 10.6.) 

See suggestions for first diag- 
nostic message. The ARRAY array 
is used throughout the system 
for variably dimensioned storage. 


Explanation 
Check format. 

The buffer to hold EVALUATE 
requests is dimensioned 100. 
The number of channels and 
channels to be evaluated for 
each EVALUATE request are 
stored in this array. 

Check format of GROUP option. 

At least two channels must be 
input . 

At least two classes must be 
input . 

Check spelling of keyword. 


Message 


g. CORE NEEDED IN ARRAY FOR 
THIS PROBLEM IS XXXXXX 
WORDS. 

10.7.12 SUBROUTINE TRNDIV 

Message 

a. REDUCED COVARIANCE MATRIX 
FOR CLASS N IS NOT POSI- 
TIVE DEFINITE. 

b. NOT ENOUGH WORK AREA IN 
TRNDIV — IWRKS Z =XXXXX . 

10.7.13 SUBROUTINE WGTCHK 

Message 

SUBCLASS IS NOT AMONG INPUT 

SUBCLASSES - WEIGHT INPUT 

IGNORED . 

10.7.14 SUBROUTINE WGTSCN 

Message 

a. SYNTAX ERROR ON WEIGHT 
CARD - REMAINDER OF CARD 
IGNORED . 

b. WEIGHT BUFFER IS FILLED - 
ONLY XXXXX CLASS NAME 
PAIRS ALLOWED. 


Explanation 

Storage is inadequate^ 
adjust parameters. 


Explanation 

The indicated covariance matrix 
cannot be inverted. 

Storage is inadequate; 
adjust parameters. 

Explanation 

Self-explanatory. 


Explanation 
Self-explanatory . 

Buffer storage is inadequate for 
all class name pairs. 


^ 0-9 

"S 


10.7.15 SUBROUTINE WHRPLC 


i 


)lanation 


The indicated channel tc be 
included is not among the 
input channels. 


lo 
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TABLE 10-1.- SELECT PROCESSOR OPTIONS AND CONTROL CARDS 


Keyword 

(a) 


Parameter and 
default values 
(b) 


Function 


BEST N 1' N 2'"* 

Default: None 


SUBCLASSES % ' ,C }n 

k<number of sub- 
classes on SAVTAP<60 
Default: All sub- 

classes on the SAVTAP 
file 


CHANNELS C l' C 2'***' C k 

k<number of channels 
on SAVTAP <30 


Finds the best set of N^, 
Nj,*** channels if proce- 
dure 1 or 2 is indicated. 

If procedure 3 is indicated, 
the best N^, N 2 linear com- 
binations of the channels 
are found. N i' N 2'**’ are 
integers separated by commas. 
A request can be made for a 
maximum of 10 best in one 
call to SELECT. 

Provides for use of only 
subclasses C i/C 2 ,*** sta- 
tistics for computation of 
separability measure; allows 
the user to select a subset 
of the statistics on the 
SAVTAP file for use in com- 
puting the subclass separa- 
bilities. c i' C 2'*‘* are 
integers representing the 
subclass number as it occurs 
in the SAVTAP file. 

Selects the best set of chan- 
nels from those indicated on 
this card. Must be a subset 


a The keyword must be left justified in card columns 1 through 10. 
b 

The parameter and default values are in card columns 11 through 72 (beginning 
in any column past 10) . 


TABLE 10-1.— Continued 


Parameter and 
Keyword default values 

Default: All channels 

on the SAVTAP file 


OPTION STATS 

Default: No statis- 

tics printed 

WEIGHTS C1=XX, (C1,C2)=YY, 
OTHERS* ZZ 

Default: All weights 

set to 1.0 for crite- 
ria 2 and 3. For cri- 
terion 1, weights for 
subclass pair (i,j) 
are W^e” 0 *}/ 16 , 
where D^j is the 
divergence for sub- 
class pair (i, j ) . 


Function 

of the channels for which 
statistics are input via the 
SAVTAP file or module STAT 
card file. c i' C 2'*** are 
integers separated by commas. 

Prints a summary of the sta- 
tistics for the subclasses 
and channels actually used 
in SELECT. 

Sets weights for axl subclass 
pairs of subclasses Cl to XX, 
then sets subclass pair (Cl, 
C2) to YY; sets all other 
subclass pairs to ZZ. Sub- 
class names Cl, C2 , etc., must 
match a subclass name from 
the module STAT card file, 
the SAVTAP file, or a GROUP 
name . c See OPTION-CLSWT for 
constraint. 


c Consider the problem of selecting channels which best separate wheat from 
nonwheat classes, where wheat is divided into subclasses Wl, W2, and W3, 
and nonwheat is divided into the subclasses NW1, NW2, NW3, and NW4. It is 
desirable to set all weights between subclasses in each class to zero, 
whereas wheat/nonwheat class pair weights are set to 1. This can be 
accomplished by the following WEIGHTS control cards; Wl=l. ,W2=1. ,W3=1. , 
(W1,W2) =0. ; and (W1,W3)=0. , (W2,W3)=0. ,OTHERS=0. Wl=l. will set weights 
for the following subclass pairs equal to 1; (W1,NW1) , (W1,NW2) , (W1,NW3) , 

(W1,NW4) , (W1,W2) , (W1,W3) . W2=l. will set the weights for the following 

subclass pairs equal to 1: (W2,NW1) , (W2,NW2) , (W2,NW3) , (W2,NW4) , (W2,W3) , 

(W2,W1) . W3=l. will set weights for the following subclass pairs equal 

to 1; (W3,NW1) , (W3,NW2) , (W3,NW3) , (W3,NW4) , (W3,Wi) , (W3,W2) , (W1,W2)=0. 

resets this subclass pair weight to 0. OTKERS=0. sets all other subclass 
pair weights to 0. 


TABLE 10-i.— Continued 


Keyword 

B-MATRIX 


B-MATRIX 


EVALUATE 


Parameter and 
default values 

CARDS 

Default: None 


FILE 

Default: None 


»c 2 . 

Default ; 


None 


Function 

Indicates that the B-matrix 
card file immediately fol- 
lows? results in the evalua- 
tion of the separability 
measure using the linear 
combinations defined by the 
B-matrix if procedure 4 is 
indicated. If procedure 3 
is indicated, the B-matrix 
will be used as a first 
guess for the Davidon- 
Fletcher-Powell Procedure. 

Indicates that a previous 
execution of SELECT has 
written the BMFILE. Depend- 
ing on the PROCEDURE card, 
the B-matrix on file will be 
used as an initial guess for 
the Davidon-Fletcher-Powell 
Procedure or in evaluating 
the separability measure. 

Evaluates the separability 
measure indicated on the 
CRITERION card for channels 
C 1' C 2'**** The set of 
channels to be evaluated 
must be (1) a subset of the 
channels on CHANNEL card and 
(2) must be on one card. 
Several sets of channels may 




TABLE 10-1.— Continued 


Keyword 


MODULE 


GROUP 


PROCEDURE 


Parameter and 

default values Function 


be input by using more than 
one EVALUATE card. 

Blank Indicates that the module 

STAT card file immediately 
follows. The SAVTAP file 
will be written as this 
card file is read. 


NAME , I , J , • • • 

Default: No grouping? 
individual subclasses 
are used. 


Groups the training sub- 
classes I,J,***, pools their 
statistics, and assigns NAME 
as the group name. NAME may 
be any six characters. Inte- 
gers I,J,*** must correspond 
to the subclasses as they occur 
in the module STAT card 
file or the SAVTAP file. 


N N=l: The Exhaustive Search 

Procedure is used; N=2: The 

Without Replacement Procedure 
is used? N=3: The Davidon- 

Fletcher-Powell Procedure is 
used; N=4 : The user-input 

B-matrix is used to evaluate 
the separability measure; 

W=5 : The Evaluate Chan- 

nels Procedure is used; and 
N=6: Invokes the best 

k of N options. 



I ft 


TABLE 10-1.— Continued 


Keyword 

OPTION 


BSPASS 

NCPASS 

APRIOR 


Parameter and 

default values Function 


CLSWT 

.Default: The weights 

are assigned to 
intersubclass pairs. 


The processor determines the 
class/subclass correspondence 
(after any grouping of sub- 
classes to form new subclasses 
if the GROUP control card is 
used) and assigns a weight of 
1.0 to the subclass pairs 
associated with all inter- 
class pairs. Intraclass sub- 
class pairs are given a weight 
of 0.0. 


(NOTE: The WEIGHTS control 

card remains available to 
allow the user to set weights 
for specific subclass pairs. 

If used, the input subclass 
pair weights override the 
processor set subclass pair 
weights. The WEIGHTS OTHERS 
capability is not available 
when this option is exercised. 
If input, it is ignored by the 
processor . ) 

N N is the number of passes to 

Default: None be included in the best set. 


N 

Default: 4 


N = number of channels per 
pass (acquisition) . 


Default: 
Omit card 


This card sets the switch to 
modify intersubclass weights. 




TABLE 10-1.— Continued. 


Keyword 

CRITERION 


INCLUDE 


'STATFILE 


Parameter and 

default values Function 


N 

Default: N=1 


C 1' C 2' * 

Default: None 


UNIT=N/FILE=M 


The indicated criterion is 
used to measure the separa- 
bility between subclasses. 

N=1 for weighted average 
divergence; N=2 for weighted 
transformed divergence; and 
N=3 for weighted average 
Bhattacharyya distance. 

Includes channels C }/ C 2 '*** 
in the best set; meaningful 
only for the Without Replace- 
ment Procedure. C 'i ,C 2 / *** 
must be a subset of channels 
on CHANNELS card. 

N is the Fortran logical unit 
number to which the SAVTAP 
! file has been assigned; M is 

j 

(1) the file number from 
which the training statistics 
are to be retrieved and 

(2) , if the module STAT deck 
is- input, the file number on 
which the statistics are to 
be stored. If M^l, this 
control card must precede the 
module STAT card file in the 
control card deck setup. 


TABLE 10-1.— Concluded. 


Keyword 

HED1 

HED2 

ICOUNT 

DATE 

COMMENT 

*END* 

$END* 


Parameter and 

default values Function 


Any 60 characters 
beginning in column 11 
Default: LYNDON B. 

JOHNSON SPACE CENTER 

Any 60 characters 
beginning in column 11 
Default: HOUSTON, 

TEXAS 

N 

Default: N=300 

Any 12 characters 
beginning in column 11 
Default: Current date 


Replaces first header line 
with the indicated char- 
acters in the parameter field. 

Replaces second header line 
with the indicated charac- 
ters in the parameter field. 

Number of iterations for the 
Davidon-Fletcher-Powell 
Procedure . 

Prints the 12 characters in 
the right corner of the 
heading in place of the 
current date. 


Any 60 characters Prints a comment line using 

beginning in column 11 the 60 characters found in 

Default: No comment the parameter field, 

printed 


Blank 


Signals the end of the 
control cards. 


Blank 


Signals the end of all card 
input for the processing 
function . 




Figure 10-1.— Functional flow chart for the SELECT processor. 
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11. CLASSIFICATION PROCESSOR - CLASSIFY 


The classification processor CLASSIFY classifies the MSS image 
data based on statistics (mean vectors and covariance matrices) 
computed from the training fields. 

11.1 PROCEDURES 

Given the statistics for each subclass of interest, each data 
point within the defined classification field from the MSS data 
tape (DATAPE) is assigned to a subclass by one of two procedures. 

In the first procedure, the user does not define categories in 
his input, and the standard m-class maximum likelihood classifi- 
cation rule is followed. However, to decrease the number of 
times the density function must be computed, the classification- 
by-thresholding procedure proposed by Hallum and Minter (ref. 8) 
and improved upon and implemented by Feiveson (ref. 9) is used. 
The standard classification rule (i.e., when no categories are 
defined by the user) is outlined in section 11.1.1. 

In the second procedure, the user defines categories in his 
input, and the sum-of-normal-densities classification rule is 
followed, as set out in section 11.1.2. 

11.1.1 STANDARD M-CLASS CLASSIFICATION 

Assuming multivariate normal probability density functions and 

using the maximum likelihood classification rule, the data 
T 

vector X = (Xj^Xj ,X g, • • • ,X N ) is assigned to subclass i in the 
following manner. 

The assumed joint probability density function of X, when 
material of type i fills the MSS field of view, is given in 
the following equation. 



( 11 - 1 ) 


Pi(X) = 


p i 


(2u) N / 2 |K i | 1 / 2 


3 -0.5Qi 


where : 

= a priori probability for subclass i 
N = number of channels used for classification 
= covariance matrix for subclass i 
X = data vector (X 1 ,X 2 ,X 3 , • • • ,X N ) 

Q ± = (X - y i ) T K" 1 (X - \i t ) 

y^ = mean vector for subclass i 


Because of the exponential form of p i and because JlnCp^ is a 
monotonically increasing function of p^, for computational pur- 
poses it is convenient to define a new function V\ by 

V i = Jln(p i ) = An(p i ) - j&n (2 tt) - jiln|K i | 

- j(X - y i ) T K7 1 (X - y ± ) (11-2) 

The data vector X is classified as belonging to subclass i if 
V ± > Vj for all i f j, where j = l,2,3,*»*,n and n = number of 
subclasses. 


The number of times the function must be computed may be 
reduced by the use of thresholds; i.e., real numbers y^ 
(independent of X) such that 

v i (X) > y ± j implies V j _(X) > V.. (X)' 

► 

and Vj(X) > y ± j implies V.. (X) > V i (X) 

where i , j = 1 , 2 , 3 , • * • , n and i ^ j . 


(11-3) 




The utility of these thresholds is that, if V. (X) > y . . , V. (X) 

1 J J 

need not be computed. Once the values for have been deter- 
mined, they may be used for each observation vector X. 


11.1.2 SUM-OF -NORMAL-DENSITIES CLASSIFICATION 

Also, assuming multivariate normal probability density functions, 

T 

the category classifier classifies the data vector X = (X^,X 2 / 
X 2 ,***,X N ) to category j and subclass i in the following manner. 


The probability density function for each category j is computed 
by the following equation. 


Pj (X) 



i=l 




T 72 e 


-0.5Qi 


(11-4) 


where 

i = subclass number 
j = category number 

kj = number of subclasses in category j 

p ^ = a priori probability for subclass i in category j 

N = number of channels used for classification 

Having computed the probability density function for all cate 
gories, the data vector X is classified as belonging to cate- 
gory j if p. > p„, where SL = l,2,3,**»,q for all j ^ & and 
3 ^ 

q = number of categories. 


The data vector is classified as belonging to subclass i if the 

probability density function for subclass i in category j is 

such that p.. > p. for all i f m, where m = 1,2 , 3 , • • • ,k . . In 
j i jm j 

the computation of p ^ , if the value of the quadratic form 
is smaller than -88, the computer cannot store the computed 
value of e Qi . Thus, e Qi = 0 for all Q ± < -88. In the case 




of all Pj * 0 for j = 1,2,3, the data point will not be 
classified; it will be assigned to a null subclass. 

When the line printer map of the classified data is displayed, 
each data point is printed with the symbol representing the 
legitimate subclass to which the data point belongs, and the 
null subclass is printed with the blank symbol. Figure 11-1 
gives the functional flow of the CLASSIFY processor. 

11.1.3 PROCEDURE 1 

For Procedure 1 applications, the CLASSIFY processor allows an 
option for obtaining subclass a priori values using subclass 
population data from the input file, SAVTAP, and also allows the 
system to assign the category names using the class names from 
the input statistics file, SAVTAP, as the assigned category 
names . 

Both options are an addition to the usual capability of analyst 
input of a priori probability values at the subclass, class, or 
category level via the APRIORI control card and of a category 
name input via the CATEGORY control card. 

11.2 INPUT FILES 

An MSS data tape (DATAPE) must be input to the CLASSIFY processor. 
The tape assignment defaults to logical unit 11; however, by 
input of the DATAFILE control card, the user may assign any 
available logical unit. (See table 5-1 for file assignments 
and section 3.2, MSS Image Data Tapes , for further information 
on format . ) 

11.3 OUTPUT FILES 

The classification results are output on the MAPTAP file 
(see appendix D) which is assigned to logical unit 2. In the 


rp 


event of card input of- the module STAT file, the statistics will 
be output on the SAVTAP file (see section 4.1). 

11.4 SCRATCH FILES 

The processor requires no scratch file. 

11.5 CARD INPUT 

All required input card types are described below. 

11.5.1 PROCESSOR CARD 

The processor keyword is left justified beginning in column 1; 
the parameter FILE is punched starting in column 11, For example, 

$CLASSIFY FILE=N 

This card directs the system monitor routine to load all routines 
used by the CLASSIFY processor. The parameter FILE=N informs the 
processor to output the current classification results on file N 
of MAPTAP. If no integer file number is specified, the processor 
defaults to file 1 of MAPTAP. 


11.5.2 SPECIAL SYSTEM FILES 

The training statistics may be input by means of the module STAT 
file. The B-transformation matrix may be input by means of the 
B-matrix file. The EOD-LARSYS deck formats are described in sec- 
tion 3.1. The use of card input is an option; normally, card 
image files are used. 


11.5.3 CONTROL CARDS 

The cards described in table 11-1 are the complete set of control 
cards recognized by the CLASSIFY processor. All options available 
to the user are exercised by use of the appropriate processor con- 
trol card. If a default condition is specified, the control card 


/3J> 


is optional. If no default condition is specified, the control 
card is mandatory input. Ordering of the sequence of processor 
control cards is unnecessary, with the exceptions that (1) the 
♦END card must follow the last processor control card, (2) the 
$END card must follow the last field definition card for an area 
to be classified, and (3) the STATFILE control card must precede 
the input of the module STAT file in some cases. See table 11-1. 

11.5.4 FIELD DEFINITIONS 

Areas to be classified are communicated to the classification 
processor by using the field definition data card described in 
section 3.1.3, which contains the scan line and sample coordi- 
nates for the area over which classification is to be performed. 
At least one field definition card must be in the run deck imme- 
diately following the *END control card. As many field defini- 
tion cards as there are areas to be classified may be input. The 
processor will classify each field in the order it is identified, 
will print on the line printer the first 110 samples of the 
classification map, and will print any optional output prescribed 
by the control cards for each field classified. The scan line 
and sample coordinates specified on the field definition card 
must be available on the input MSS data tape (DATAPE) . 

11.6 CARD OUTPUT 

The classification processor outputs no punched cards. 

11.7 RESTRICTIONS 

The system-related restrictions described in section 19, along 
with the following, apply to the CLASSIFY processor. 

The category, class, and subclass relationship is as follows: 

Number of < Number of < Number of < gg (11-5) 

categories - classes - subclasses - 




The size of the B-matrix cannot exceed 450 locations: 

/Number of linear) /Number of chan- \ < ,en (11-6) 

\ combinations /\nels in B-matrix/ - ' ” ' 

The channels used in computing the B-matrix automatically replace 
the channels, if any, on the CHANNELS control card. 

The largest sample number of the classification field minus the 
smallest sample number of the classification field cannot exceed 
1000. 


Beginning with the smallest sample number of the classification 
field, only the next 110 samples are displayed on the line-printer 
map output by CLASSIFY, but the entire classified field is dis- 
played on the line-printer map output by DISPLAY. 

When applying the category classifier option, 12 500 storage 
locations are reserved for the data such that 

Sample end - sample start\ /Number of\ „ „ enn -, s 

\ 2"~1 / \ channels ) - 12 500 (11 “ 7) 


When applying the standard classifier option, the table computed 
for the class-pair thresholding procedure shares this storage of 
12 500 locations reserved for the data such that 


/Number of _ ,\ 

/Number of 
(subclasses *7 

Number of 

.(subclasses “ 

2 

subclasses. 


/ Sample end - sample start \Number of < . « ,- 00 
\ 2+1 /channels 


( 11 - 8 ) 


11.8 DIAGNOSTIC MESSAGES 

The diagnostic messages and the routines in which they appear are 
as follows . 


/?r 


11.8.1 SUBROUTINE CLSFY1 


Message 

a. ***** CLSFY/CLSFY1 

THE COVARIANCE MATRIX FOR 
SUBCLASS NO. XX IS EITHER 
SINGULAR OR NOT POSITIVE 
DEFINITE - THE DETERMI- . 
NANT = XXXX.XXXX 
***** TERMINATING PROGRAM 
EXECUTION ***** 


11.8.2 SUBROUTINE CLSFY2 
Message 

a. WIDTH OF RECTANGULAR 
FIELD SURROUNDING 
CLASSIFICATION FIELD 
CANNOT EXCEED 1000 POINTS. 


Explanation 

The determinant of each sub- 
class covariance matrix is 
checked by CLASSIFY to see 
that it is a positive nonzero 
value. A zero value indicates 
a singular matrix, and a nega- 
tive value i indicates a non- 
positive definite matrix. If 
either condition occurs for any 
subclass covariance matrix to 
be used in classification, the 
processor will stop."*" 


Explanation 

The largest sample of the 
classification field minus the 
smallest sample of the classi- 
fication field cannot exceed 
1000 samples. Reduce amount 
of samples per scan line. 


A probable source of an invalid covariance matrix is a module 
STAT file which has been incorrectly formatted and thus is not 
producing good training class statistics. Another possible 
source is that the SAVTAP file does not contain valid statis- 
tical data. 


Message 


Explanation 


b. AS THE COMPUTER CANNOT Self-explanatory. 

EXPONENTIATE A NUMBER 
SMALLER THAN EXP (-88) , 

XXXXXX PTS WERE NOT CLASSI- 
FIED IN THIS FIELD. 


C. TOO MUCH DATA REQUESTED 


When too much data has been 


requested, (1) for the standard 
classifier, reduce parameters 


so that 

( Number of 
subclasses 

+ Number of 
subclasses 

x /Number of 
Ichannels 


_ ,\ /Number of 
J (subclasses 

+ /Points per\ 
l scan line J 


)•- 


12 500; or 


- 2 ) 


/ 


(2) for category classifier, 
reduce data so that the number 
of points per scan line x num- 
ber of channels < 12 500. 


11.8.3 SUBROUTINE REDIF2 

Message Explanation 

a. ERROR ON CHANNELS CARD. Check job setup and unit 

assignments . 

fa. **** CLSFY/REDIF2 

3-MATRIX INPUT STIPULATED 
BY CONTROL CARD. . . 

****UNABLE TO INTERPRET 

TYPE OF B-MATRIX INPUT 

PROGRAM EXECUTION TERMI- 
NATED FROM REDIF2**** 


Message 


Explanation 


c. ** CLSFY/REDIF2 — B-MATRIX 
INPUT FROM BMFILE - 

BAD INPUT VALUES DETECTED: 
NO. COMBINATIONS 

(BMCOMB) = , NO. 

CHANNELS (BMFEAT) = , 

CHANNEL VECTOR ( BMVEC ) = 

**** TERMINATING PROGRAM 
EXECUTION FROM REDIF2 ***** 

d. *** CLSFY/REDIF2 - BAD CARD 
INPUT ON APRIORI CARD - 
DEFAULT APRIORI PROBABILITY 
VALUES WILL BE USED. 

e. AT LEAST TWO (2) CATEGORIES 
MUST BE ASSIGNED. EXITING 
FROM REDIF2. 

f. **** CLSFY/REDIF2 BAD 

PROCESSOR CONTROL CARD **** 

TERMINATING PROGRAM EXECU- 
TION FROM REDIF2 **** 

11.8.4 SUBROUTINE SETUP 2 
Message 

a. ***** CLSFY/ SETUP 2 ••• 

ERROR CONDITION ON ATTEMPT 
TO POSITION MAPTAP TO FILE 
NO. XX 

*****ERROR STATUS CODE=YY 
ABORTING THE RUN***** 


The input B-MATRIX control card 
is printed out as part of the 
error message. One of the data 
cards following it is incorrectly 
formatted. Check deck setup and 
B-matrix card file. Invalid data 
from the BMFIL has been deleted. 


Check format of APRIORI card . 


In exercising the category 
option, two or more categories 
must be used. 

Check spelling of keyword. 


Explanation 

The CLASSIFY processor attempted 
to position the output classi- 
fication results file (MAPTAP) 
to the file number specified 
on the $CLASSIFY processor card. 
Possibly (1) more files were 
indicated than currently 

ty ■- 


Message 


b. AN ERROR HAS OCCURRED IN 
GROUPING CLASSES INTO 
CATEGORIES. CHECK THE 
FOLLOWING : 

NOT ALL OF THE CLASSES 
HAVE BEEN ASSIGNED TO A 
CATEGORY . 

A CLASS NAME ON THE CATE- 
GORY CARD HAS BEEN MIS- 
SPELLED. CLASS NAMES FROM 

SAVTAP FILE ARE: 

CLASS NAMES FROM CATEGORY 
CARDS ARE: 

C. USER INPUT A PRIORI VALUES 
DO NOT SUM TO 1.0. INPUT 
VALUES WERE NORMALIZED.** 

d. ** ERROR IN A PRIORI CON- 
TROL CARD. USER INPUT 
VALUES IGNORED.** 

e. NO. OF CHANNELS REQUESTED 
FOR DATA TAPE AND NO. OF 
CHANNELS ON STAT FILE MUST 
BE EQUAL. 


Explanation 

existed on the MAPTAP file, 

(2) bad tape if the file is 
assigned to tape, or (3) the 
format of the $CLASSIFY proc- 
essor card is incorrect. 

When an error occurs in group- 
ing classes into categories, 
either one or more class names 

(1) have not been assigned or 

(2) have been misspelled. The 
program lists the class names 
as submitted from the SAVTAP 
file or cards. Check these 
for errors. If neither (1) nor 
(2) is applicable, check the 
module STAT file to assure that 
class names are left justified 
in the field. 

Self-explanatory . 


Check format of APRIORI card. 


Self-explanatory. 


1^-rT" 
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TABLE 11-1.- CLASSIFY PROCESSOR OPTIONS AND CONTROL CARDS 


Keyword 

(a) 


SUBCLASS 


CHANNELS 


CATEGORY 


Parameter and 
default values 

(b) Function 


^Kji 60 '* 

i = number of sub- 
classes in training 
statistics 
Default: All the 

training subclasses 
are used. 

STAT=N 1 ,N 2 ,*’* ,N k , 
DATA=M X ,M 2 , • • • ,M k 
k<30 

Default: (1) If exe- 

cuted back to back 
with SELECT, the chan- 
nels selected by the 
SELECT processor are 
used; (2) If a 
B-matrix is input, the 
channels used in com- 
puting the matrix are 
used; (3) Otherwise, 
all channels in the 
training statistics 
are used. 

CATNAM/NAMEj^ ,NAME 2 , • • • 
Default: If no 


K's are integers comprising the 
set of subclass numbers used by 
the processor to classify the 
unknown data points; must be a 
subset of training subclasses 
designated as they occur on the 
SAVTAP file. 

N^ ,N 2 , • • • ,n r are the channel 
numbers (integers) from the 
SAVTAP file to be used in 
classification; M ]/ M 2 ' * * * ,m k 
are the channel numbers 
(integers) from the MSS data 
tape (DATAPE) . The number of 
channels selected from SAVTAP 
and DATAPE must be equal . 


Informs the processor that the 
category classifier option 


a The keyword must be left justified in card columns 1 through 10. 

u 

The parameter and default values are in card columns 11 through 72 (beginning 
in any column past 10) . 


TABLE 11-1.— Continued. 


Keyword 


Parameter and 

default values Function 


categories are 
defined, the M-class 
standard classifier 
is applied. 


will be applied and defines 
one category name (CATNAM) and 
the class names (NAME. 1 s) for 
this category. All subclasses 
for a class are assigned to 
this category. CATNAM and 
NAME^ may be up to four charac- 
ters , and NAME^ must match a 
class name on the SAVTAP file, 

A slash (/) separates the cate- 
gory name from the class name. 


Note : (1) Every class must be 

assigned to a category unless 
the class was eliminated by 
omitting all of its subclasses 
on the SUBCLASS control card; 
(2) At least two categories 
must be defined; (3) Continu- 
ation of the list of class 
names in the category on 
another card is indicated by 
an asterisk after the last 
class name of that card. The 
next card should continue the 
list of class names in col- 
umns 11-72. (See test example 
in section 11.6.) 


FILE 

Default: No cate- 

gories are defined 


Initiates the assigning of the 
category names using the class 
names from the input 


/<// 


CATEGORY 


Keyword 


GROUP 


DATAFILE 


TABLE 11-1.— Continued. 


Parameter and 
default values 

and the standard 
classifier will be 
applied. 

SUBNAM,K 1 ,K 2 , • • • ,K ± 
1<K^<60; i=number of 
training subclasses. 
Default: Subject to 

the SUBCLASS control 
card, each individual 
training subclass is 
used as a possibility 
for unknown data 
sample classification. 


UNIT=N , FILE=M 
Default: N=11,M=1 


Function 

statistics file, SAVTAP, and 
invokes the category 
classifier. 

K^'s are integer subclass num- 
bers taken from the set of 
available training subclasses. 

The processor creates a new 
training subclass by combining 
the statistics of the training 
subclasses listed. The training 
subclasses used are not there- 
after available as individual 
subclass possibilities for an 
unknown data sample. The set 
of training subclasses to be 
used is renumbered by the proc- 
essor to account for the new 
grouped subclass and the train- 
ing subclasses deleted by 
grouping. The revised set of 
training subclasses is used 
for all processor output. 

SUBNAM may be from one to four 
characters and will become the 
name for a new training subclass. 

N is the Fortran logical unit 
number to which the MSS data 
tape (DATAPE) has been assigned; 

M is the file number of the tape 
to be processed. For back-to- 


U^T4 

twl 


TABLE 11-1.— Continued. 


Keyword 


STATFILE 


MODULE 


B-MATRIX 


Parameter and 
default values 


UNIT=N,FILE=M 
Default: N=20,M=1 


Blank 

Default: Training 

subclass statistics 
are read from the 
input file SAVTAP. 


Function 

back executions of several 
processors, if using the same 
file number, only one DATAFILE 
control card need be input . 

N is the Fortran logical unit 
number to which the SAVTAP file 
has been assigned; M is: 

(1) the number of the file to 
be processed or (2) , if the mod- 
ule STAT file is input, the num- 
ber of the file for storing the 
statistics. If M^l, this control 
card must precede the module STAT 
file in the control card deck 
or card image file setup. 

Indicates to the processor that 
the training subclass statis- 
tics will be input on cards. 

The module STAT file must imme- 
diately follow this control 
card. See section 3. 1.4.1 for 
further details. 


CARDS or FILE Informs the processor that the 

Default: No transfor- B-transformation matrix is to 

mation of training be input and applied to the 
subclass covariance training subclass statistics 

matrices prior to classification. If 

FILE is placed in the parameter 
field, the mode of B-matrix 
input will be from BMFILE ; if 

/f'J 


TABLE 11-1.— Continued. 


Keyword 


APRIORI 


Parameter and 

default values Function 

CARDS is specified, the 
B-matrix card deck must imme- 
diately follow this control 
card. The channels which 
were used to derive the B- 
transformation matrix will be 
the channels used by the 
processor in classification. 
(See section 3. 1.4. 2 for 
further details.) 


or 

N*A i ,K*A N+ i / ---,A M 

M<60 

Default: If executing 

the standard classi- 
fier, each subclass is 
given an equal 
a priori value. If 
executing the category 
classifier, each cate- 
gory is given an equal 
a priori value which 
is divided equally 
among the subclasses 
in that category. 


A priori values may be input by 
subclass, class, or category. 

N and K are arbitrary repeti- 
tion factors , and A^ ' s are 
decimal numbers such that 
M 



i=l 


M = number of training sub- 


classes, training classes, or 


categories. If input by class 
or category , the setup routine 
will distribute the a priori 
values among the subclasses in 
the following manner: 

. _ Class a priori values 

By class Number of subclasses 
in that class 


Category a pr'ior'i 

values 

Number of sub- 
classes in that 
category 


•Vi/ 


By category = 


TABLE 11-1.— Continued. 


Keyword 


APRIORI 


OPTION 


HED1 


HED2 


Parameter and 
default values 


FILE 

Default: Subclass 

a priori is not to 
be computed from 
the statistics 
file, SAVTAP. 


Function 

The order in which the A. 's 

1 

are input must be the order 
in which the category, class, 
or subclass was defined. 

The subclass a priori proba- 
bility values are computed 
using subclass or cluster 
point populations from the 
statistics file, SAVTAP. 


STATS 

Default: No training 

subclass statistics 
printout 


Training statistics will be 
printed out for each subclass, 
reflecting the B-transformation, 
if any, and the Cholesky fac- 
torization of the covariance 
matrices . 


Any 60 characters Replaces first header line with 
beginning in column 11 the indicated characters in the 
Default: LYNDON B. parameter field. 

JOHNSON SPACE CENTER 


Any 60 characters Replaces second header line 

beginning in column 11 with the indicated characters 
Default: HOUSTON, in the parameter field. 

TEXAS 


Any 12 characters Prints the indicated 12 charac- 
beginning in column 11 ters in the right corner of the 
Default: Current date heading in place of the current 

date . 




DATE 


TABLE 11-1.— Concluded. 


Keyword 

COMMENT 


*END 

SEND 


Parameter and 

default values Function 

Any 60 characters Prints a comment line using 

beginning in column 11 the 60 characters found in 
Default: No comments the parameter field, 

printed 

Blank Signals the end of the control 

cards. 

Blank Signals the end of card input 

for the processing function. 


_JX£8 

,-Ji 



INTERPRET USER-INPUT OPTIONS. 

SELECT REQUESTED SUBSET OF STATISTICS 
FROM SAVTAP FILE. 

PERFORM THE LINEAR TRANSFORMATION ON 
STATISTICS IF B-MATRIX IS INPUT, 
OBTAIN CHOLESKY DECOMPOSITION OF 
COVARIANCE MATRICES. 

DETERMINE A PRIOR PROBABILITY FOR 
EACH SUBCLASS EITHER FROM USER 
INPUT OR SUBCLASS PIXEL POPULATION. 


Y ya: is 


tr 


V. 



SS \ 

r 


COMPUTE TABLE OF THRESHOLDS 
(V; j) FOR ALL SUBCLASS 

PAIRS 1 AND j 

3 

r 

SET i - 1 OR SUBCLASS TO WHICH 
LAST PIXEL WAS ASSIGNED 

m k 

r 



COMPUTE CENSITY FUNCTION p. 
FOR PIXEL X, SUBCLASS I ' 

m k 

f 
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Figure 11-1.— Functional flow chart for the CLASSIFY processor, 
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12. PERFORMANCE DISPLAY PROCESSOR - DISPLAY 

The DISPLAY processor reads the MAPTAP tape output by CLASSIFY 

and performs the following functions. 

a. Provides a line printer map of each classified field on 
MAPTAP. The training and test fields within the classified 
image are outlined. 

b. Produces classification summaries for each classified field/ 
which gives a breakdown on the number of pixels classified 
into, and the number of pixels thresholded from, each sub- 
class, class, and category. 

c. Produces (optionally) an intensive test site (ITS) classi- 
fication summary for one crop type versus all other crop 
types; the user-specified crop may be a category, class, or 
subclass . 

d. Allows the user to designate fields to be excluded from the 
classification summaries. Fields may be designated "uniden- 
tifiable" or "other." Pixels within the unidentifiable 
fields are counted and are not considered in the classifica- 
tion summaries. Pixels within the designated "other" fields 
are counted as a separate crop type regardless of how they 
were classified. These pixels are included in category 
"other" in the ITS report. (See section 12.4.4 for sample 
input of designated fields.) 

All pixels within the designated areas are printed with the 
pound (#) symbol. 

e. Assigns a pixel to the threshold class if thresholding is 
requested and if > t^, where 

T -1 

= the value of the quadratic form (X - y^) (X - y^) 
as computed by CLASSIFY (section 11.1.1) for subclass i 

y^ = mean vector for subclass i 





* covariance matrix for subclass i 
t^ = threshold value for subclass i 
Allows t^ to be determined in one of four ways : 

• User input — The user inputs the exact threshold value. 

See control cards THRESHOLD and OPTION THRESHOLD VALUE. 

• Chi-square option — The user inputs confidence levels for 
each subclass on the THRESHOLD card and includes the 
OPTION CHI SQUARE. The program obtains the chi-square 
threshold value from an internal chi-square functional 
routine . 

• Empirical option — The user inputs confidence levels for 
each subclass on the THRESHOLD card and includes the OPTION 
EMPIRICAL card. The program determines the empirical dis- 
tribution function for each subclass from the cumulative 
histogram of for correctly classified pixels in the 
ground truth areas (i.e. , training or test fields) , as 
shown in the following example. 

100 
95 

Confidence 
level 


0 

From this curve, the user input of a 95-percent confidence 
level for subclass i would result in a threshold value of 
6 . 0 . 

See reference 10 for more information on the use of 
empirically computed thresholds. 

• Fisher F-distribution option — The user inputs confidence 
levels for each subclass on the THRESHOLD card and includes 



' & 


the OPTION FISHER card. The program obtains the 
F-distribution threshold values from an internal routine. 
If a computational overflow occurs in the routine, the 
threshold value for that subclass is set equal to 999.999. 

q* Produces plots of the empirical distribution function when 
OPTION PLOT is exercised. 

h. Performs (optionally) a four-nearest-neighbors spatial fil- 
tering on the classified image. This algorithm takes into 
consideration that, in many instances, a pixel is most likely 
to be like its nearest neighbors. When the option is exer- 
cised via the OPTION FILTER control card, the four nearest 
neighbors of each pixel are examined. If all the neighbors 
are classified the same and the pixel in question is classi- 
fied differently, then it is assumed that the pixel was 
classified incorrectly and its classification is changed. 

In the following example, the pixel classified as X will be 
changed to C. (See reference 11 for more information on 
this algorithm. ) 

Line 1 C 

2 C X C 

3 C 

i. Outputs (optionally) the classified image onto tape (MAPUNT) 
in either LARSYS III or Universal format via the FORMAT NAME 
control card. 

j. Provides classification performance summaries for ground 
truth areas within the classified image. The following six 
performance summaries are available to the user. The fields 
in these reports are either training fields used in the STAT 
or ISOCLS processor and transmitted to DISPLAY via the MAPTAP 
file or test fields input directly to DISPLAY (see sec- 
tion 12.4.4) . 

• Field by subclass 



• Field by class 

• Field by category 

• Class by subclass 

• Class by class 

• Class by category 

k. In Procedure 1 applications, DISPLAY is able: 

1. To accept the dot data file, DOTUNT. 

2. To provide a dot classification performance summary by 
dot categories which also includes: 

• A tabulation of both the uncorrected proportion and 
the bias corrected proportion of each dot category in 
the total area classified. 

• An "alpha" table which tabulates proportions for each 
labeled category of bias correction dots (type 2 
dots) , which, for each category, are the ratios of 
the number of dots with the given analyst-labeled 
category that were classified into each possible 
category, to the total number of dots classified into 
each category. 

3. To provide a dot classification performance summary for 
each dot on the analyst's specified file (DOTUNT). 

The functions of the DISPLAY processor are such that the analyst 
may either exercise the initial processor capabilities 
(a) through (j) or the LAC IE Procedure l capabilities (k). 

The difference between the two capabilities is in the type and 
format of classification performance tables output. 


Figure 12-1 shows a function flow chart of the DISPLAY processor. 


12.1 INPUT FILES 


The only input file required for DISPLAY is the MAPTAP (sec- 
tion 4.4) file output by CLASSIFY. This file must be assigned 
to logical unit 2. 

For Procedure 1, the DISPLAY processor optionally accepts as 
input the dot data file, DOTUNT, created by the DOTDATA proc- 
essor. The file is assigned either to logical unit 19 or to 
a user-specified unit. 

12.2 OUTPUT FILES 

The DISPLAY processor will optionally generate a tape of the 
classified image on the MAPUNT unit for display on the DAS. 

The control card FORMAT allows the user to exercise this option. 
When requested, the tape should be assigned to a nine-track 
tape drive for compatibility with the DAS tape drives . The 
tape assignment must be made to logical unit 16. 

For Procedure 1, the DISPLAY processor optionally provides dot 
data classification performance summaries (instead of the 
normal output classification summaries described above) if the 
DOTUNT control card is input to the DISPLAY processor. 

12.3 SCRATCH FILES 

The disk provides random access storage for a scratch file in 
DISPLAY. No assignment is necessary. 

12.4 CARD INPUT 

All system formats referred to in this section are defined in 
section 3. 




12.4.1 PROCESSOR CARD 

The processor keyword is left justified starting in column 1; 
the parameter FILE is punched starting in column 11. For 
example, 

$DISPLAY FILE=N 

This card directs the monitor routine to select the DISPLAY 
processor and initiates the loading of routines used by DISPLAY. 
Parameter value N is the file number on the MAPTAP file to be 
processed; if not input, default is to file 1 of MAPTAP. 

12.4.2 SPECIAL SYSTEM FILES 

No special files are required for the DISPLAY processor. 

12.4.3 CONTROL CARDS 

Table 12-1 lists the control cards and available options for 
the DISPLAY processor. 

12.4.4 CLASS, SUBCLASS, AND FIELD DEFINITIONS 

Both test and designated fields are optional input to DISPLAY. 
However, both types of fields cannot be input in the same exe- 
cution of DISPLAY. If no test fields are input, the ground- 
truth summaries will be for training fields. When input, test 
fields must be identified with a previously defined class or 
subclass. All test class, subclass, and field definitions begin 
immediately following the *END control card and are terminated 
by the $END control card. Formats for the CLASSNAME , SUBCLASS, 
and field definition cards are defined in section 3.1.3. The 
following example shows test field input to DISPLAY . Note that 
test fields are identified with classes; that is, each NAMEl , 
NAME 2 , NAME3 , etc., must match the name of a class defined in 
either STAT or ISOCLS and used in CLASSIFY. 


$DISPLAY 
(Control cards) 
*END 

CLASSNAME NAMEl 

FIELD 1 

FIELD 2 

CLASSNAME NAME 2 

FIELD3 

FIELD4 

FIELDS 

CLASSNAME NAME 3 

FIELDS 

$END 


In the following example, test fields are identified with sub- 
classes, in which case each NAMEl, NAME 2 , NAME 3 , etc., must 
match the name of a subclass used in CLASSIFY. 

$DISPLAY 
(Control cards) 

*END 

SUBCLASS NAMEl 

TEST FIELDS FOR SUBCLASS NAMEl 

SUBCLASS NAME2 

TEST FIELDS FOR SUBCLASS NAME 2 

$END 

Note than actual name must not exceed four characters. 

Designated fields are ’ 'rge areas within the classified area 
which are either unidentifiable or can be specifically identi- 
fied as being other than the crop type of interest. This type 
of field input is meaningful only when the ITS summary report 
is being generated for one specific crop type. Pixels within 
unidentifiable areas are removed from the summaries altogether. 
Pixels within the designated "other" areas are counted as other 
regardless of how they were classified. 


An example of input designated fields follows. 
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$DISPLAY 
(Control cards) 

*END 

DESIGNATE UNIDENTIFIABLE 
(Field definitions) 

DESIGNATE OTHER 
(Field definitions' 

$END 

Either one, both, or neither of the two types of designated 
fields may be input. 

If the Procedure 1 option is to be exercised, the only kinds of 
fields that may be input are DO/DU fields. No test fields may be 
input; the training fields are the bias correction dots on the 
dot data file, DOTUNT. The format of the DO/DU field cards and 
the method of input are given in section 3.1.3. 


12.5 CARD OUTPUT 


No cards are output by the DISPLAY processor. 

12.6 RESTRICTIONS 

The system-related restrictions given in section 5 apply to 
this processor. 

12.7 DIAGNOSTIC MESSAGES 

See section 6 for further diagnostic messages. 


12.7.1 SUBROUTINE DISTCV 

Message 

OVERFLOW. 

12.7.2 SUBROUTINE DSPLAY 

Message 

** DISPLAY** - FIELDS MUST BE 

DEFINED FOR SUBCLASSES FOR 
\ 

EMPIRICAL THRESHOLDS. 

12.7.3 SUBROUTINE DSPLY1 

Message 

NOT ENOUGH STORAGE FOR CO ARI- 
ANCE MATRICES - DSPLYl. 

12.7.4 SUBROUTINE DSPLY2 

Message 

a. DISPLAY WILL ACCEPT ONLY 
1000 PTS/SCAN LINE. 


Explanation 

Error flag set by CHIN indi- 
cates overflow condition. 

Explanation 
Self-explanatory . 


Explanation 
Adjust parameters. 

Explanation 
Adjust parameters. 


/S7 


Message 


Explanation 


b. **** DSPLY2/D0TSUM - - Check if correct DOTFILE is used. 

DISCREPANCY IN DOTFILE 

INFORMATION **** 

NO. OF DOT CATEGORY LABELS 
MATCHING MAPTAP CATEGORY 
NAMES = XXXXXX.. 

NO. OF DOT CATEGORIES IS 
GIVEN AS = XXXXXX. 

c. **** DSPLAY/DSPLY2 **** Maximum number of dots exceeded. 

NO. OF DOTS = XXXXXX 

EXCEEDS THE MAX. 

ALLOWABLE (250). 

**** DOT PERFORMANCE 
SUMMARIES WILL NOT BE 
PRODUCED. 

12.7.5 SUBROUTINE EMTHRS 

Message Explanation 

ERROR BACK SPACING MAPTAP. Hardware tape-read error. 

ISTAT = XXXXX. 


12.7.6 SUBROUTINE FDIST 
Message 

FDIST-OVERFLOW CONDITION IN 
FISHIN ROUTINE FOR 
SUBCLASS = XXXX. THRESHOLD 
SET TO 999.999. 


Explanation 

The FISHIN system subroutine 
has returned an overflow con 
dition. The threshold value 
is set to 999.999 by the 
program . 


12.7.7 SUBROUTINE PRTSUM 

Message 

THE CROP NAME XXXX DOES NOT 

MATCH A CATEGORY, CLASS OR 

SUBCLASS NAME. 

THE INTENSIVE TEST SITE SUMMARY 

REPORT CANNOT BE PRINTED. 

12.7.8 SUBROUTINE REDIF3 

Message 

a. **** DSPLAY/REDIF3 

ERROR IN ' OPTION ' CARD • • • 
**** SCAN OF THIS CARD 

DISCONTINUED PROCEEDING 

TO NEXT CARD **** 

b. ***** fisher THRESHOLD 
REQUESTED - NOT PERFORMED 

•••NO. SAMPLES FOR SUB- 
CLASS NAME (=N) IS LESS 
THAN OR EQUAL TO NUMBER 
OF CHANNELS (=M). 

c. ERROR IN ACREAGE CARD - 
CARD IGNORED. 

d. ERROR ON DOTFILE CARD. 

e. *** A THRESHOLD VALUE IS 
OUTSIDE THE ALLOWABLE 
RANGE 0-1, THEREFORE 
NO THRESHOLDING HAS BEEN 
DONE IN THIS RUN *** 

XXXXX, XXXXXXXXX . XXXXX 


Explanation 

Check spelling on CROP cards. 


Explanation 

Check format and spelling of 
parameter . 


The program compares the number 
of samples to the number of 
channels. If the number of 
samples < number of channels , 
the threshold request is 
bypassed. 

Check format. 

Check format and parameters . 
Check format of THRESHOLD card. 


/S? 


Message 


Explanation 


f . * ERROR ON SUBCLASS NAME 
CARD - XXXX DOES NOT MATCH 
A SUBCLASS FROM THE MAPTAP 
FILE. 

g. * ERROR ON CLASSNAME CARD - 
XXXX DOES NOT MATCH A CLASS 
NAME FROM THE MAPTAP. 

FILE . * 

h. TEST FIELDS AND DESIGNATED 
FIELDS CANNOT BE INPUT 
TOGETHER. 

i. INVALID CONTROL CARD- 
CHECK SPELLING OF KEY WORD. 

12.7.9 SUBROUTINE SETUP 3 
Message 

a. *****DSPLAY/SETUP3 ••• 

ERROR CONDITION ON ATTEMPT 
TO POSITION MAPTAP OVER 

FILES. *****FSBSFL 

STATUS CODE = 

ABORTING RUN *** 


Self-explanatory . 


Self-explanatory . 


Self-explanatory . 


Self-explanatory. 


Explanation 

The system routine for posi- 
tioning files (FSBSFL) has 
encountered difficulties in 
positioning MAPTAP to the 
correct file. Error occurred 
in SETUP 3 routine for DISPLAY. 

User should make sure that the 
correct file number for the 
MAPTAP has been indicated and 
that MAPTAP does in fact have 
the correct number of files . 
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Message 

b. ****** DISPLAY/SETUP 3 - 
CORE OVERFLOW (TOP-TOP2) 

BY XXXXXX — EXECUTION 
TERMINATED ****** 

C. CLASSIFICATION BY CATEGORY 
(ON MAPTAP) IS REQUIRED IN 
ORDER TO PROCESS THE DOT 
DATA *** 

*** DOT PERFORMANCE SUM- 
MARIES WILL NOT BE OUTPUT 
**** 

d. *** CLASSIFICATION BY CAT- 
EGORY (ON MAPTAP) IS 
REQUIRED IN ORDER TO 
PROCESS THE DOT DATA *** 
*** DOT PERFORMANCE SUM- 
MARIES WILL NOT BE 
OUTPUT *** 

e. **** SETUP3** FROM 
DOTFIL, THE NO. OF 

DOT CATEGORIES = XXXXXX. 

**** DO t PERFORMANCE 
SUMMARIES WILL NOT BE 
PROVIDED **** 

f. *** DOT DATA PERFORMANCE 
SUMMARIES WILL NOT BE PRO- 
DUCED - THE CATEGORY NAMES 
FROM MAPTAP AND DOTFILE DO 
NOT MATCH. 


Explanation 

Subroutine SETUP 3 has computed 
the storage needed for the 
specific problem; if more is 
needed than is available, this 
diagnostic is printed. 

Procedure 1 uses category 
classifier only. 


Self-explanatory . 


Error in DOTFIL - reset DOTKEY 
and TRNKEY - turn on DOTERR. 


Change appropriate category 
names. Correct as required. 


/*/ 


Message 

CATEGORY NAMES FROM DOTFILE 


Explanation 


ARE - 

**** SETUP 3/DTCHK FROM 

DOTFILE, THE MIN. LINE 
NO. = XXXXXX. MAX. LINE 
NO. = XXXXXX. 

***** D0T PERFORMANCE 
SUMMARIES WILL NOT BE 
PRODUCED ***** 


TABLE 12-1.- DISPLAY PROCESSOR OPTIONS AND CONTROL CARDS 


Keyword 

(a) 


Parameter and 
default values 
(b) 


OPTION PLOT 


OPTION CHI SQUARE 
Default: 0 


OPTION FISHER 

Default: 0 


OPTION EMPIRICAL 
Default: 0 


Function 

Plots the empirical distri- 
bution functions obtained 
from the cumulative histo- 
grams of Q i for each subclass. 

Computes thresholds from the 
chi-square distribution using 
the confidence levels input 
on the THRESHOLD control 
card. 

Computes thresholds from the 
Fisher F-distribution using 
the confidence levels input 
on the THRESHOLD control 
card. 

Computes the empirical 
threshold values using the 
percentages input on the 
THRESHOLD control card. 

Uses the numbers input on the 
THRESHOLD control card for 
the actual threshold value. 


a The keyword must be left justified in card columns 1 through 10. 

3d 

The parameter and default values are in card columns 11 through 72 (beginning 
in any column past 10) . 

C If the THRESHOLD control card is input, one of the four options (CHI SQUARE, 
FISHER, EMPIRICAL, or THRESHOLD VALUE) should be input also. If the OPTION 
card is omitted and the THRESHOLD card is input, chi square is assumed. If 
more than one THRESHOLD option is input, only the last one read will be 
performed. 


12*15 

3 


TABLE 12-1.— Continued 


Keyword 

Parameter and 
default values 

Function 

SYMBOLS 

S J' S 2'"' S k 
k*number of sub- 
classes on MAPTAP 
Default : 1 , 2 , • • • 9 

A,B,C,D, • • • ,Z, 

1,2, 3, 4 

Assigns symbols s i, s 2 '**’ to 
subclasses 1,2,***, 
respectively. 

OPTION 

STAT 

Default: No statistics 

printed 

Prints statistics for sub- 
classes used in the previous 
CLASSIFY run. These statis- 
tics are saved on the MAPTAP. 

OPTION 

PCT 

Default: Performance 

summary printed for 
classes only 

Prints a performance summary 
on a per-field as well as a 
per-class basis for ground- 
truth fields (i.e. , training 
or test fields within the 
classified image) . 

OPTION 

OUTLINE 

Default: Training 

fields are not 
outlined. 

Outlines training fields 
with asterisks; has no 
effect on test fields. (Test 
fields are always outlined 
with "+" symbol.) 

OPTION 

NOMAP 

Default: Map printed 

Instructs the processor not 
to print a map of the data; 
only a performance summary 
is printed. 

OPTION 

FILTER 

Default: Spatial fil- 

tering is not 
performed. 

Performs four-nearest-neighbor 
spatial filtering on the clas- 
sified image. 


TABLE 12-1.— Continued 


Keyword 

THRESHOLD 


Parameter and 

default values Function 


T . T . • • * T 
A l' 2' A k 

k*number of sub- 
classes on MAPTAP 
Default: No 

thresholding 


Uses the threshold values 

t l' t 2'*’* ^ or 3u b c l asses 
2 , • • • , respectively; thresh- 
olds must be positive 
floating-point numbers. One 
value must be specified for 
each subclass on the MAPTAP 
file. Thresholds may be 
specified also in the fol- 
lowing format: 

WVV" 

where and N 2 are integers 
which specify how many con- 
secutive times the correspond 
Xi.’.g thresholds should be used 


For the CHI SQUARE and the 
EMPIRICAL options, the num- 
bers input on these cards 
are the confidence levels 
(i.e., t^-0.99 means that 
the user wants to maintain 
99% or reject 1%) . 

The numbers input on the 
OPTION THRESHOLD VALUE card 
are the actual values to be 
used for thresholding (i.e., 
t^=10.02 means that the 
threshold value for sub- 
class 1 is 10.02). 


Keyword 


TABLE 12-1.— Continued. 


Function 


CROP 


ACREAGE 


SITE 


ANALYST 


Parameter and 
default values 

NAME 

Default : No ITS 

report 


TOTAL=X , 
CROP=Y / 
OTHER* Z 


Any 24 characters 
Default: Blanks 

Any IS characters 
Default: Blanks 


Any 60 characters 
Default: Blanks 


Initiates the option for 
printing the ITS summary 
report for the crop indi- 
cated. NAME must match a 
category, class, or subclass 
name used in CLASSIFY. 

The total acreage in the ITS 
is X; acreage of the crop 
named on the CROP control 
card is Y; and the acreage 
of all other crop types in 
the ITS is Z. X, Y, and Z 
are floating-point numbers. 
This input is meaningful 
only if the CROP control card 
is input. 

Name of the ITS; used in 
printing the heading for the 
ITS summary report . 

Name of the data analyst 
printed in the heading for 
the ITS summary report. 

Procedure used in classifica- 
tion of the ITS ; printed in 
the heading for the ITS 
summary report. 


PROCEDURE 


TABLE 12-1.— Concluded. 


Keyword 

FORMAT 


DOTFILE 


HED1 


HED2 


Parameter and 
default values 

NAME 

Default: No output 

classification map 
tape is generated 
by DISPLAY. 


INPUT/UNIT=n ,FILE=m 
Default: 

UNIT= 19 , FILE=1. 

No dot data classi- 
fication performance 
summaries . 


Any 60 characters be- 
ginning in column 11 
Default : LYNDON B . 

JOHNSON SPACE CJNTER 

Any 60 characters be- 
ginning in column 11 
Default: HOUSTON, 

TEXAS 


Function 

If NAME=UNIVERSAL , the out- 
put classification tape 
(MAPUNT) will be generated 
in the Universal format. If 
NAME=LARSYS, the MAPUNT tape 
will be generated in the 
LARSYS III format. 

Initiates the input of the dot 
data file, DOTUNT, from the 
designated (or default, if not 
designated) unit and/or file, 
and initiates the output of 
the dot data classification 
performance summaries . 

The parameter, m, designates 
the file to be processed by 
DISPLAY, and n designates the 
Fortran unit number assigned 
to the input file, DOTUNT. 

Replaces first header line 
with the indicated characters 
in the parameter field. 


Replaces second header line 
with the indicated characters 
in the parameter field. 


/ 


TABLE 12-1.— Concluded. 


Parameter and 

Keyword default values 

DATE Any 12 characters be- 

ginning in column 11 
Default: Current 

date 


Function 

Prints the indicated 12 char- 
acters in the right corner of 
the heading in place of the 
current date. 


COMMENT 


*END 

$END 


Any 60 characters be- Prints a comment line using 
ginning in column 11 the 60 characters found in 
Default: No comments the parameter field, 

printed 


Blank 


Signals the end of the con- 
trol cards. 


Blank Signals the end of all card 

input for the processing 
function . 


i 
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Concluded. 



13. DATA-TRANSFORMATION PROCESSOR - DATA-TR 


13.1 PROCEDURES 

The DATA-TR processor transforms images from the MSS data tape 
(DATAPE) . The linear transformation is performed on user-defined 
fields according to the following formula: 

z = BJT + b (13-1) 

where 
k < 16 
n < 30 

B = a k-by-n input transformation matrix (see section 3. 1.4. 2) 
x = an n-by-1 data vector 

b = a k-by-1 bias vector (see BIAS control card, table 13-1) 
z = a k-by-1 transformed data vector 

The user has the option of rescaling the transformed data via the 
RESCALE control card. (Control cards are listed in table 13-1.) 
If no rescaling is performed, it is assumed that the data can be 
represented in eight bits (0 < z < 2b5) . For rescaling the data 
to be within the range of 0 to 255, the following equation is 
computed . 

= * l MIN i " z il (13-2) 

where 

MIN^ = minimum value for component i 

= transformed data point for channel i 
R^ = range of component i [MAX^ - MIN^] 

MAX^ = maximum value for component i 

= rescaled transformed data point for channel i 


The user may obtain the parameters and MIN^ in one of three 
ways: the histogram method, the statistical method, or user 

input. The method and control cards associated with each 
method are defined below. 

13.1.1 HISTOGRAM (DEFAULT) METHOD 

A histogram of a segment of the transformed image is performed 
to find the R^ and MIN^ for each component of the transformed 
data. If the user-defined field is smaller than 2000 pixels, 
all pixels are used in the histogram; otherwise, the following 
formula is used to determine the line and sample increments 
needed to obtain 2000 points for the histogram. 



where 

M = number of samples 
N = number of lines 
a = increment (integer) 

In deriving an approximate range for the transformed data, the 
user may specify a percentage of points to be excluded from 
the upper and lower tails of the his tor ram by using the PEROUT 
control card. If not so specified, 2.5 percent of the points 
on the tails are excluded when determining the MAX^ and MIN^ 
values of the central 95 percent of the transformed data 
distribution. 

Optionally, the user may specify the maximum expected data value 
for each channel n of the input data vector x. Otherwise, the 
maximum data value for each channel is set equal to 255. 


13.1.2 STATISTICAL METHOD 


Activated by the RESCALE and MODULE or STATFILE control card, the 
statistical method is applied for deriving an approximate MAX^^ 
and MIN^ value for each component i. Using the subclass statis- 
tics, an approximate is computed using equations (13-4) and 
(13-5) . 

Let a. = MAX.(eJ + k a 3 .) (13-4) 

and S i = MINj(g? - (13-5) 

where 

i = l,***,m components of Z 

j = l,»»*,w subclasses 

k = an integer specified by the user (see LAM control card) 

= transformed mean of the ith component of subclass j 

= standard deviation of the ith component of subclass j com- 
puted from the transformed covariance matrix for subclass j . 

The approximate range of each component will be 

R^ = ct^ - <5^ ; i = !,•••, m (13-6) 

and MIN i = 6 i (13-7) 

255 

Letting the scale factor e. = -= — , i = l,*»»,m, the complete 

x R i 

transformation, including rescaling, to be performed on the 
original image (each pixel) is : 

Y i = £ i * l 6 i " + b i> I (13-8) 

where 

b^ = ith element of the bias vector b 

?!•/ 


Optional control cards that may be used in conjunction with the 
statistical method are SUBCLA, LAM, PEROUT, OPTION ORIG, and 
OPTION TRANSF. Their functions, as well as those of other con- 
trol cards, are described in table 13-1. 

13.1.3 USER INPUT METHOD 

The user may input his own scaling parameters via the OPTION 
SCAFAC control card or use input from a previous execution of 
DATA-TR in which the computed scaling parameters (255/R^ ,MIN^) 
were punched on cards via the OPTION PUNCH control card. When 
inputting the previously executed deck, the user should note 
that the values punched were computed using the MIN^ and MAX^ 
determined after a certain percentage of the tails of the histo- 
grams (see PEROUT control card) were discarded. 

The transformed/rescaled data are output in either the Universal 
or LARSYS III format. The otpion is controlled by the FORMAT 
controx card. 

A line printer ~>lot of the histogram (frequency distribution) of 
the transformed rescaled data is printed. If applicable, the MAX 
and MIN^ are printed. 

13.2 INPUT FILES 

An MSS data tape (DATAPE) must be input to the DATA-TR proc- 
essor. The tape assignment defaults to logical unit 11; 
however, by input of the DATAFILE control card, the user may 
assign any available logical unit. (See table 5-1 for file 
assignments and section 3.2, MSS Image Data Tapes, for further 
information on format . ) 


13.3 OUTPUT FILES 


The transformed/rescaled data are output on the TRFORM file, 
logical unit 14, in either the Universal or LARSYS III format. 

13.4 SCRATCH FILES 

The DATA-TR processor does not use scratch files. 

13.5 CARD INPUT 

All system card file input formats referred to in this section 
are defined in section 3. 

13.5.1 PROCESSOR CARD 

The keyword for the processor card is left justified beginning 
in column 1. For example, 

$DATA-TR 

This keyword directs the system monitor routine to select the 
DATA-TR processor and initiates loading of routines used by 
DATA-TR. 


13.5.2 SPECIAL SYSTEM FILES 

The B-matrix file discussed in section 3. 1.4. 2 must be input to 
this processor. The deck may be obtained from a previous exe- 
cution of SELECT. The module STAT file (section 3. 1.4.1) is 
optional input. If input, the second method for rescaling the 
input data (given in section 13.1.2) will be used. 

13.5.3 CONTROL CARDS 

Table 13-1 lists the control cards and available options for the 
DATA-TR processor. 
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13.5.4 FIELD DEFINITIONS 


See section 3.1.3 for the format of field definition cards. At 
least one field definition card must immediately follow the *END* 
control card. An output file is created for each field defini- 
tion input and is written on unit 14 . Each of these fields con- 
sists of a rectangular field which surrounds the vertices of 
the input field. All pixels outside the input field and within 
the rectangular output field are set equal to zero. The line 
and sample number will be numbered sequentially from 1. 


13.6 CARD OUTPUT 


The DATA-TR processor, via the OPTION PUNCH control card, out- 
puts the computed scaling parameters on cards. Two pairs of 
scaling parameters are punched on each card; i.e., each punched 
card contains the scaling parameters for two components of the 
transformed data. The cards must be used in the same order as 
punched. Their formats and definitions are as follows The 
number of cards is determined by the number of components . 


Columns Type/format 


Definition 


1-6 

11-17 

18-27 


A6 

A7 


Ai ,F9.3,F9.3, Al 


OPTION 

SCAFAC= 

(CON 1 ,MIN 1 ) 

where CON^=255/R^, R^ is the range of 
component 1 , and MIN^=minimum value for 
component 1. Parentheses must be input. 


28-37 Al ,F9 . 3 ,F9 . 3 , Al (CON 2 ,MIN 2 ) 

where CON 2 =255/R 2 , R 2 is the range of 
component 2 and MIN 2 =minimum value for 
component 2. Additional pairs are 
continued on succeeding cards. 



13 . 7 RESTRICTIONS 


The system-related restrictions in section 19 apply to this 
processor. 

The maximum number of channels allowed is 30, and the maximum 
number of components in the transformed vector is 16. 


13.8 DIAGNOSTIC MESSAGES 

13.8.1 SUBROUTINE LNTRAN 

Message 

a. *** THE NUMBER OF COMPO- 
NENTS IN Y-VECTOR TIMES 
THE NUMBER OF SAMPLES 
EXCEEDS THE SIZE OF 
STORAGE AREA - TERMINATE ** 

b. *** NUMBER OF CHANNELS 
TIMES NUMBER OF SAMPLES 
EXCEEDS 10 600.*** 

C. ***** DATATR/LNTRAN ***** 
ERROR ON INPUT FIELD DEFI- 
NITION CARD, FOR FIELD 
NAME XXXX. 

***** CONTINUING TO NEXT 
FIELD DEFINITION CARD(S). 

13.8.2 SUBROUTINE SETREM 

Message 

SETREM ERROR - THERE WERE XX 
SCALE FACTORS AND MINIMUM 
VALUES INPUT THROUGH THE 
SCAFAC OPTION. YY LINEAR 
COMBINATIONS WERE REQUESTED. 


Explanation 
Self-explanatory . 


Storage exceeded. 

Adjust parameters. 

Check format and parameters. 


Explanation 

This message indicates that the 
input scaling parameter pairs 
are not in one-to-one corre- 
spondence with the number of 
components of the transformed 


t 


Message 


Explanation 


THESE Zi'.TxT BE A SCALE FACTOR 

AND A MINIMUM VALUE FOR EACH 

l::near combination, the pro- 
gram WILL TERMINATE THROUGH 

CMERR. 

13.8.3 SUBROUTINE SETUP 8 
Message 

a. *** BAD CONTROL CARD - 
DATATR/ SETUP 8 *** 

b. ***** DATATR/SETUP 8 «•**** 
ERROR ON INPUT DATAFILE 

CARD CONTINUING TO 

PROCESS INPUT ***** 

c. ***** DATATR/SETUP 8 ***** 
ERROR ON INPUT STATFILE 

CARD CONTINUING TO 

PROCESS INPUT ***** 

d. *** INVALID CONTROL CARD 
REJECTED BY DATATR/ 

SETUP 8 *** 


data. Too many or too few 
pairs were input. 


Explanation 

Check spelling of keyword. 

Check control card, correct, 
and resubmit. 


Check control card, correct, 
and resubmit. 


Check spelling of parameter. 
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TABLE 13-1.- DATA-TR PROCESSOR OPTIONS AND CONTROL CARDS 


Keyword 

(a) 


Parameter and 
default values 
(b) 


B-MATRIX CARDS or FILE 
Default: None 


Function 

CARDS indicates that the 
B-matrix is on cards immedi- 
ately following. FILE indi- 
cates that the B-matrix is on 
file and initiates input of 
the BMFILE . 


FORMAT OUTP r JT=UNIVERSAL 

Default: LARSYS III 

FORMAT OUTPUT=LARSYS 

Default: LARSYS III 

RESCALE Blanks 

Default: No rescaling 


The transformed data will be 
output in Universal format. 

The transformed data will be 
output in LARSYS III format. 

Initiates rescaling of the 
transformed data to the range 
of 0 to 255. 


DATAFILE UNIT=N,FILE-M 

Default: 11=11, M=1 


STATFILE UNIT=N,FILE=M 

Default: N=20,M=1 


11 is the Fortran default logi- 
cal unit number to which the 
MSS data tape (DATAPE) has been 
assigned? M is the file number 
on the tape to be written. 

20 is the default Fortran logi- 
cal unit number to which the 
SAVTAP file has been assigned; 

M is (1) the file number on 
the tape to be processed or 
(2) , if a module STAT file is 
input, the number of the file 


a The keyword must be left justified in card columns 1 through 10. 

The parameter and default values are in card columns 11 through 72 
in any column past 10) . 



(beginning 


Keyword 


OPTION 


OPTION 


MODULE 


MODULE 


SUBCLASS 


TABLE 13-1.— Continued. 


Parameter and 

default values Function 

on which to store the training 
statistics. If M^l, this con- 
trol card must precede the 
module STAT deck. 


PUNCH 

Default: No cards 

punched 

SCAFAC=(CON 1 ,MIN 1 ) , 
(CON 2f MIN 2 ) 

(CON^ ,MIN^) 

Default: Histogram 

method of rescaling 


FILE 

Default: If RESCALE 

is input, the histo- 
gram method is 
assumed. 

CARDS 

Default: If RESCALE 

is input, the histo- 
gram method is 
assumed. 


Directs the program to punch 
the scaling parameters 
(CON^MIN^) on cards. 

CON and MIN are floating 
point values separated by a 
comma. Blanks between the 
two values are ignored. The 
scaling parameters should be 
ordered according to the 
transformed data vector 
components . 

Initiates reading of the 
SAVTAP file; if rescaling is 
performed, it initiates the 
statistical method. 


Initiates reading of the 
module STAT file that must 
immediately follow this card; 
if rescaling is performed, 
it initiates the statistical 
method. 


S 1' S 2' S 3' * * ’ ,S k 
k<number of sub- 
classes on SAVTAP<60 


Integers which define a sub- 
set of subclasses S^S^Sj,*** 
from the input statistics 


J^° 


TABLE 13-1.— Continued 


Keyword 


LAM 


PEROUT 


MAXPT 


Parameter and 
default values 

Default: Statistics 

for all subclasses 
defined are used in 
calculating the 
scaling factors. 

N 

Default: N=2 


N 

Default: N=5, in 

which case 5% of the 
total distribution 
will be deleted from 
both the upper and 
lower tails of the 
transformed data set. 


M l' M 2' M 3'*-*' M k 

k<30 

Default: 255,255, ••• 


Function 

(SAVTAP) to be used in cal- 
culating the scaling factors 
and approximating . 


An integer multiplied by the 
standard deviations of the 
input subclass statistics to 
derive an approximate range 
for rescaling the transformed 
data . 

An integer which specifies the 
percentage of points to be 
deleted from the upper and 
lower tails of the trans- 
formed data distribution in 
computing an approximate 
range for rescaling. For the 
histogram method of rescaling, 
N/2% is deleted from both the 
upper and lower tails of the 
histogram. For the statisti- 
cal method of rescaling, N% 
is deleted from both the upper 
and lower tails of the 
histogram. 

Maximum expected value of MSS 
data tape (DATAPE) input for 
each channel. M's are integer 
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TABLE 13-1.— Continued 


Keyword 


OPTION 


OPTION 


BIAS 


HEDl 


Parameter and 
default values 


ORIG 

Default: No sta- 

tistics printout 


TRANSF 

Default: No sta- 

tistics printout 


Function 

used in deriving an approximate 
range (MIN^MAX.) of the 
transformed data set for the 
histogram method of rescaling. 

Initiates the printout of the 
original (untransformed) sta- 
tistics for the subclasses 
input for the statistical 
rescaling method. 

Initiates the printout of the 
transformed statistics. 


b l' b 2' b 3''’*' b k 

or 

N * b l' b N+l'"’' b V 
k=number of coi ^onents 

in the transformed 

data set and N=an 

integer repetition 

factor for 

i<16 

Default: b^=0.0 

Any 60 characters 
beginning in column 11 
Default: LYNDON B. 

JOHNSON SPACE CENTER 


All b's are decimal (floating 
point) numbers , separated by 
commas ; they comprise the bias 
vector to be applied in the 
transformation of the input 
data set: Z=B (x+b) 


Replaces first header line 
with the indicated characters 
in the parameter field. 
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TABLE 13-1.- 
Parameter and 

Keyword default values 

HED2 Any 60 characters 

beginning in column 11 
Default: HOUSTON, 

TEXAS 

DATE Any 12 characters 

beginning in column 11 
Default: Current date 

COMMENT Any 60 characters 

beginning in column 11 
Default: No comments 

printed 

*END Blank 

$END Blank 


Concluded. 

Function 

Replaces second header line 
with the indicated characters 
in the parameter field. 

Prints the indicated 12 charac- 
ters in the right corner of 
the heading in place of the 
current date. 

Prints a comment line using 
the 60 characters found in 
the parameter field. 

Signals the end of the control 
cards . 

Signals the end of all card 
input for the processing 
function. 
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Figure 13-1.— Functional flow chart for the DATA-TR processor. 










Figure 13-1.— Concluded. 
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14. STATISTICS TRANSFORMATION PROCESSOR - TRSTAT 


The TRSTAT processor will read a SAVTAP file or card deck gener- 
ated by STAT or ISOCLS, perform a linear transformation on the 
means and covariances, and output the transformed statistics on 
a new file. The equation for the linear transformation of the 
means is as follows: 

V' - A y + b (14-1) 

where 

A = a k-by-n matrix (see section 14.4.2); k < 15 and n £ 30 
y = an n-by-1 mean vector 

b ■= a k-by-1 bias vector (see card type 4, section 14.4.2) 
y ' = a k-by-1 transformed mean vector 

The equation for the linear transformation of the covariances 
is as follows: 


K' = AKA T (14-2) 

where 

K = an n-by-n covariance vector 
T 

A = an n-by-k transpose of A 

K' = a k-by-k transformed covariance matrix 

14 . 1 INPUT FILES 

A set of statistics must be input either from the SAVTAP file 
or by cards. (See STATFILE or MODULE control card, table 14-1.) 

14 . 2 OUTPUT FILES 

The transformed statistics are output on a file in the SAVTAP 
format. (See section 4.1 and STATFILE control card.) 
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14 . 3 SCRATCH FILES 


The TRSTAT processor uses no scratch files. 

14.4 CARD INPUT 

All system formats referred to in this section are defined in 
sections 3 and 14.4.2. 


14.4.1 PROCESSOR CARD 

The processor keyword is left justified starting in column 1. 
For example, 

$ TRSTAT 

This card directs the monitor routine to execute the TRSTAT 
processor and initiates loading of routines used by TRSTAT. 


14.4.2 A-MATRIX FILE 

The A-matrix file is composed of a transformation matrix and an 
additive bias vector. Its format is shown below. For additional 
information on the transformation matrix, see section 3. 1.4. 2. 


Card type Columns 

Type/format 

1 1-8 

Alphanumeric 

2 6-7 

Integer/12 

13-14 

Integer/I2 

17-80 

Integer/12 

3 6-20 

Real/E 15. 8 

21-35 

• 

• 

Real/El 5. 8 

• 

66-80 

Real/El5 . 8 


Definition 

Keyword A-MATRIX. 

Number of linear combinations. 
Number of channels. 

Actual channels used in 
computation . a 

Element 1 
Element 2 

Element 5 of A-matrix — 
the full matrix is punched, 


a The channels in the B-matrix described in section 3. 1.4. 2 begin in column 18. 
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Definition 

5 values per card. Addi- 
tional elements are contin- 
ued on succeeding cards. 

Element l a 
Element 2 

Element N of b- vector, 

5 values per card, with 
additional values continued 
on succeeding cards? N=number 
of linear combinations. 

14.4.3 CONTROL CARDS 

Table 14-1 lists the control cards and available options for 
the TRSTAT processor. 

14.4.4 FIELD DEFINITIONS 

No field definition cards are input to TRSTAT. 

14.5 CARD OUTPUT 

The transformed statistics deck will be output in the same format 
as the module STAT card file. 

14.6 RESTRICTIONS 

The system-related restrictions in section 19 apply to this 
processor. 


Card type Columns Type/format 


6—20 Real/E15. 8 

21-35 Rea 1/El 5. 8 


66-80 Real/El 5. 8 


a 

Unlike the B-matrix described in section 3. 1.4. 2, the A-matrix file contains 
the additive vector. 


140 
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The maximum dimension of the A-MATRIX is IB by 30, and the 
maximum number of elements in the additive B-vector is 30. 


14 • 7 diagnostic MESSAGES 

14.7.1 SUBROUTINE SETUP 9 

Message 

a. *** BAD SUPERVISOR CONTROL 
CARD SETUP9*** 

b. ERROR ON STATFILE CARD. 

C. NUMBER OF CHANNELS FROM 
STAT FILE DOES NOT EQUAL 
THE NUMBER OF CHANNELS ON 
A-MATRIX FILE. CHANNELS 

ON STAT FILE = . 

CHANNELS ON 

A-MATRIX = . 

d. INVALID CONTROL CARD 
REJECTED 
*** SETUP 9 *** 

14.7.2 SUBROUTINE TRAMTX 

Message 

ERROR IN TRYING TO POSITION 
TRANSFORMED STAT FILE TO 
BEGINNING OF FILE XXX. 


Explanation 

Invalid control card. Check 
spelling of keyvord. 

Check spe] ling of keyword 
and parameters . 

Self-explanatory . 


The parameter field of the 
control card is in error . 


Explanation 
Check file assignment. 



TABLE 14-1.- TRSTAT PROCESSOR OPTIONS AND CONTROL CARDS 


Keyword 

(a) 


Parameter and 
default values 
(b) 


A-MATRIX Blank 

Default: File must 

be input 


Function 

Initiates input of the A-matrix 
and b '-vector. The A-matrix card 
file (see section 14.4.2 for 
format) immediately follows 
this card. 


OPTION P,0,T 


MODULE Blank 


STATFILE INPUT/UNIT=N,FILE=M, 
OUTPUT/UNIT=L , FILERS 
Default: N«l; M=l; 

L=14 ; S=1 


P punches the transformed sta- 
tistics; 0 prints the original 
statistics; and T prints the 
transformed statistics . 

Initiates input of the module 
STAT card file, which immedi- 
ately follows this card. 

(See section 3. 1.4.1 for 
module STAT file format.) 

N is the Fortran logical unit 
number to which the tape con- 
taining the statistics to be 
transformed has been assigned; 
M is the number of the file to 
be processed; L is the Fortran 
logical unit number to which 
the transformed statistics are 
to be output; and S is the 
number of the next file to be 
created on output SAVTAP file. 


a The keyword must be left justified in card columns 1 through 10. 

^The parameter and default values are in card columns 11 through 72 (beginning 
in any column past 10) . 




Keyword 


Function 


CHANNELS 


SUBCLASS 


HED1 


HED2 


DATE 


COMMENT 


TABLE 14-1.— Continued. 


Parameter and 
default values 

8 1 ,N 2 ,N 3 ,... / N k 
k=number of matrix 
channels<30 
Default: None 


N's are integer channel numbers 
referring to the SAVTAP file. 

The number of channels requested 
rrom SAVTAP must be equal to the 
number of channels on the 
A-matrix file. 


S^,S 2 /S 3 , • • • ,S^ Transforms statistics for only 

k<number of subclasses subclasses S^ ,S 3 ,S 3 , • • • . 

on SAVTAP<60 

Default: Statistics 

for all subclasses 

defined 


Any 60 characters Replaces first header line 

beginning in column 11 with the indicated characters 
Default: LYNDON B. in the parameter field. 

JOHNSON SPACE CENTER 


Any 60 characters 
beginning in column 11 
Default: HOUSTON, 

TEXAS 

Any 12 characters 
beginning in column 11 
Default: Current date 


Replaces second header line 
with the indicated characters 
in the parameter field. 


Prints the indicated 12 charac- 
ters in the right corner of the 
heading in place of the cur- 
rent date. 


Any 61 characters Prints a comment line using 

beginning in column 11 the 60 characters found in 
Default: No comments the parameter field, 

printed 


TABLE 14-1.— Concluded 


Keyword 
*END 

$END 


Parameter and 
default values 

Blank 


Blank 


Function 

Signals the end of the control 
cards. 

Signals the end of all card 
input for the processing 
function. 
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TRSTAT 


INPUT CONTROL CARDS 


INPUT A-MATRIX FILE 


INPUT STAT FILE 



OUTPUT TRANSFORMED 
STATISTICS FILE 


Figure 14-1.— Functional flow chart for the TRSTAT processor 



15. DIMENSIONAL HISTOGRAM PROCESSOR - NDHIST 


The NDHIST processor computes an n -.mensionai histogram of areas 
of the MSS data tape (DATAPE) of which the user has requested 
scatter plots. The pixel dimensions are user specified by the 
plotting channels, and the histogrammed pixels are output on the 
NHSTUN file. The file is written as an interface to the SCTPRL 
processor. 

15.1 PROCEDURES 

The number of channels (dimensions) used in histogramming is 
specified by means of the CHANNELS control card defined in 
table 15-1. The plotting channels are the primary input channels. 
The color channels are for further delineation of the frequency 
determination . 

In the case of n > 2, the dimensionality is reduced to 2 in the 
SCTRPL processor by means of a linear transformation. 

Composed of the plotting channels, each unique (positional) pixel 
within the field designates the position of a pixel on the scatter 
plot tape, SCTRUN . The frequency of each positional pixel is 
determined as a function of the color channels (if input) and the 
plotting channels. 

If only plotting channels are input, the positional pixel and 
its frequency are calculated using plotting channel data. If 
both plotting and color channels are input, the frequency is 
a function of both sets of channels. 

The color assignment for each plot pixel may be optionally set 
by the NDHIST or the SCTRPL processor. If applicable, the color 




codes are output on the NHSTUN file. The color codes may be 
set using the following information. 

a. The original radiance value of the pixel (see CHANNELS con- 
trol card, table 15-1) . 

b. The mean value of the cluster or subclass to which the pixel 
was assigned during clustering or classification. In exer- 
cising this option, the user must input a classification or 
cluster map (see MAPUNT control card, table 15-1) to this 
processor. To execute the SCTRPL processor, a SAVTAP file 
related to the MAPUNT must be input (see CHANNELS and STATFILE 
control cards, section 16, table 16-1) . The subclass or 
cluster numbers assigned to the pixel during classification 

or clustering are stored on the NHSTUN file, passed to the 
SCTRPL processor, and used for retrieving the means from the 
SAVTAP file. 

c. The mean value of the test or training field from which the 
pixel was extracted. 

d. User-defined colors (see COLOR control card, table 16-1) . 

e. From any pass on the MSS data tape when using multiregistered 
Landsat data (see CHANNELS control card, table 15-1) . 

The areas selected for histograraming are defined by test and/or 
training fields. The manner in which the fields are collected 
or grouped for histogramming is user controlled by input param- 
eters. The data vectors may be histogrammed collectively on a 
class, subclass, or per-field level. The maximum number of 
fields input on any level is 200, and the maximum number of 
unique data vectors accumulated on any level is 12 000 divided by 
one-fourth the number of plotting channels. 

A functional flow diagram of the NDHIST processor is given in 
figure 15-1. 



15.2 INPUT FILES 


An MSS data tape (DATAPE) must be input to the NDHIST processor. 
The tape assignment defaults to logical unit 11; however, by- 
input of the DATAFILE control card, the user may assign any 
available logical unit. (See table 5-1 for file assignments 
and section 3.2, MSS Image Data Tapes, for further information.) 
Optionally, a classification or cluster MAPUNT tape may be 
input (see MAPUNT control card) . 

15.3 OUTPUT FILES 

A multifile tape is always output: NHSTUN file. It is an inter- 

face to the SCTRPL processor and must be assigned to tape or 
Fastrand. No file-skipping capability is available; the first 
file created is always file 1. (See HISFIL control card, 
table 15-1, and appendix I.) 

15.4 SCRATCH FILES 

The NDHIST processor dynamically assigns random access drum 
storage for the histogram counters, color codes, identification 
information, and (optionally) the pixel assignment from the 
classified or clustered image tape (MAPUNT) . 

15 . 5 CARD INPUT 
15.5.1 PROCESSOR CARD 

The processor keyword is left justified starting in column 1. 

For example, 

$NDHIST 

This card directs the system monitor routine to select the NDHIST 
processor and initiates loading of all the NDHIST routines into 
the system. 


/?7 


15.5.2 SYSTEM CARD FILES 


No special system card decks are required for the NDHIST 
processor. 


15.5.3 CONTROL CARDS 

Table 15-1 lists the control cards and available options for 
the NDHIST processor. 


15.5.4 FIELD DEFINITIONS 

The field cards, which immediately follow the *END control card, 
define the areas to be histogrammed, and the OPTION control card 
determines the level of histogramming. The fields may be ordered 
in one of four ways: 

a. As input to STAT (section 8.4.4) 

b. As input to ISOCLS (section 9.5.4) 

c. As input to CLASSIFY (section 11.5.4) 

d. As a user-defined field (section 3.1.3) 

For example: 

*END 

CL? SS NAME WHT 

SUBCLASS WHT1 

(Field card 1) 

(Field card 2) 

SUBCLASS WHT 2 

(Field card 3) 

CLASS NAME NWHT 

SUBCLASS NWH1 

(Field card 4) 

SUBCLASS NWH2 

(Field card 5) 

SUBCLASS NWH3 

(Field card 6) 

(Field card 7) 

$END 



If the histogram is accumulated on class bases, fields 1, 2, and 
3 are histogrammed collectively and output as data file 1; and 
fields 4, 5, 6, and 7 are histogrammed collectively and output 
as data file 2. 

If the histogram is accumulated on subclass bases, fields 1 and 
2 are histogrammed collectively and output as data file 1; field 3 
is histogrammed and output as data file 2; field 4 is histogrammed 
and output as data file 3; field 5 is histogrammed and output as 
data file 4; and fields 6 and 7 are histogrammed collectively and 
output as data file 5. 

If the histogram is performed on per-field bases, each field is 
histogrammed separately and output to a file, making a total of 
seven data files created. 

On a cumulative histogram, a maximum of 200 fields may be input. 
See section 3.1.3 for format of the field definition card. 

15.6 CARD OUTPUT 

The NDHIST processor does not provide punched card output. 

15.7 RESTRICTIONS 

The system-related restrictions in section 19 apply to this 
processor. Other restrictions are as follows. 

• A maximum of 16 channels may be histogrammed. 

• A maximum of 4 channels may be used for color codes. 

• The maximum of unique vectors to be histogrammed is 

< 12 000 

- 1/4 (number of channels) 
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(15-1) 


• A maximum of 4000 words of storage is allowed for storing the 
MSS data. The equation for computing maximum number of 
pixels is 

< 4000 

n - (number of channels (number of samples per scan line) ] 

(15-2) 


IS. 8 DIAGNOSTIC MESSAGES 

15.8.1 SUBROUTINE ADORES 

Message 

TOO MUCH DATA REQUESTED. 

REDUCE NO. OF SAMPLES PER 
SCAN LINE AND/OR NO. OF 
CHANNELS . 

15.8.2 SUBROUTINE NDHST1 

Message 

a. — VECTORS WERE NOT HISTO- 
GRAMMED, BUT USED IN 
COMPUTING FIELD MEANS, 

IF APPLICABLE. 

b. ERROR IN FIELD CARD. 
ABORTING . 

15.8.3 SUBROUTINE NDHST2 

Message 

CORE LIMITS EXCEEDED. MAXIMUM 
NO. OF VECTORS ACCEPTED IS 


Explanation 
Self-explanatory . 


Explanation 

The his tog rammed vector table 
is full. N number of unique 
vectors were not histogrammed . 

Check format and parameters . 


Explanation 
Self-explanatory . 
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15.8.4 SUBROUTINE RESTO 


Message 

ERROR READING DISK. 

15.8.5 SUBROUTINE SET 10 

Message 

a. INVALID CONTROL CARD - 
IGNORED . 

b. ERROR ON CHANNELS CARD. 

c. ERROR ON DATA FILE CARD. 

d. ERROR ON DAS FILE CARD. 

e. ERROR ON N-DIM HISTOGRAM 
FILE CARD. 

f. ERROR ON OPTION CARD. 

15.8.6 SUBROUTINE STODAT 

Message 

NOT ENOUGH DISK SPACE TO STORE 
DAS TAPE DATA. 


Explanation 

Self-explanatory. 

Explanation 

Check spelling of keyword. 

Check parameter field of 
CHANNELS control card. 

Check parameter field of 
DATAFILE control card. 

Check parameter field of DAS 
control card. 

Check parameter field of 
HISFIL control card. 

Check parameter field of 
OPTION control card. 

Explanation 
Self-explanatory . 




TABLE 15-1.- NDHIST PROCESSOR OPTIONS AND CONTROL CARDS 


[All m and n are integers . ] 

Keyword 
(a) 

CHANNELS 

the SCTRUN tape (i<16) . If 
i-2, n^ defines the sample 
location and n 2 the line 
location on the scatter plot 
tape (SCTRUN). If i>2, the 
pixels will be transformed to 
two components in the SCTRPL 
processor; component 1 will 
define the sample location 
and component 2 the line loca- 
tion on the scatter plot tape 
(SCTRUN) . The m's represent 
the channels for the color 
codes (j<4) . If the COLOR 
channels are input, the histo- 
gram is a function of both the 
PLOT and COLOR channels; if 
the COLOR channels are omitted 
the histogram is a function of 
only the PLOT channels. (See 
section 16 for further 
• information . ) 


Parameter and 
default values 
(b) 

PLOT=n lf n 2 , n 3 , • 
COLOR -m^ , m 2 ,m 3 , 
Default: None 


.■v 


Function 

The n's are the channels for 
determining the position (PLOT) 
of the pixels to be output on 


The keyword must be left justified in card columns 1 through 10. 
b 

The parameter and default values are in card columns 11 through 72 (beginning 
in any column past 10) . 



TABLE 15-1 


OPTION 


MEANS 


Continued. 


Function 

N is the logical unit number 
assigned to the MSS data tape 
(DATAPE) ; M is the file number 
of the data to be processed. 

N is the logical unit number 
assigned to the MAPUNT tape; 

M is the file number of the 
data to be processed. (The 
order of the fields to be his- 
togrammed must correspond to 
the order of the clustered or 
classified fields on the input 
MAPUNT tape.) 

The logical unit number 
assigned to the NHSTUN file. 

Fields will be histogrammed 
on basis of classes. 

Fields will be histogrammed 
on basis of subclasses. 

Fields will be histogrammed 
on per-field basis. 

The means of each field will be 
computed for the COLOR channels 
on the CHANNELS card and output 
on the NHSTUN file. 


t 3 
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TABLE 15-1.- 
Parameter and 

Concluded. 

Keyword 

default values 

Function 

HED1 

Any 60 characters 

Replaces first header line 


beginning in column 11 

with the indicated characters 


Default: LYNDON B. 

JOHNSON SPACE CENTER 

in the parameter field. 

HED2 

Any 60 characters 

Replaces second header line 


beginning in column 11 

with the indicated characters 


Default: HOUSTON, 

TEXAS 

in the parameter field. 

DATE 

Any 12 characters 

Prints the indicated 12 charac 


beginning in column 11 

ters in the right corner of 


Default: Current date 

the heading in place of the 
current date. 

COMMENT 

Any 60 characters 

Prints a comment line using 


beginning in column 11 

the 60 characters found in 


Default : No comments 

printed 

the parameter field. 

*END 

Blank 

Signals the end of the control 
cards . 

$END 

Blank 

Signals the end of all ;jard 


input for the processing 
function . 
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16. SCATTER PLOT PROCESSOR - SCTRPL 


The SCTRPL processor reads the NHSTUN file written by the NDHIST 
processor, determines the location of each unique data vector on 
the scatter plot, and outputs the spectral plot in the Universal 
format. A scatter plot is created and output for each file 
stored on the NHSTUN unit. 

The location (line and sample intersection) of each pixel on the 
two-axis scatter plot will be computed using either the radiance 
values or two linear combinations of radiance values. (This 
option is controlled in section 15 by the CHANNELS control card, 
table 15-1.) 

If the data vector is to be transformed (see E-MATRIX control 
card, table 16-1) , the following equation and conditions will 
be applied: 

y = Bx + c (16-1) 

where 

y = a 2-by-l vector 
B = a 2-by-n vector, n < 16 
x = an n-by-1 vector, n < 16 
c = a 2-by-l vector 

If the transformed data are to be rescaled (see SCALE control 
card, table 16-1) , the following equation will be applied: 

/HI. - L0.\ 

j * i MIN i - z xi ' 16 - 2 > 

where 

HI^ = an input parameter for the upper rescale limit for 
channel i 




LO^ = an input parameter for the lower rescale limit for 
channel i 

MIIJ^ = minimum value for channel i 
Z^ = transformed data point for channel i 
« range (MAX^ - MIN^) for channel i 
= rescaled transformed data value for channel i 
MAX^ * maximum value for channel i 

The scatter plot is created and output line by line. All 
the pixels belonging to a line, as determined by the second coor- 
dinate of the pixel, are collected; and, in the sample location 
determined by the first coordinate, the color assignment and 
frequency of occurrence of each pixel are output as channels 1 
through n, respectively. [See procedures for NDHIST processor 
(section 15.1) for definition of color assignments.] 

The dimensions of the output file are user controlled by the 
following input control cards, which are defined in greater 
detail in table 16-1 (the control cards for the SCTRPL processor) 
and table 15-1 (the control cards for the NDHIST processor) . 

Keyword Parameters 

SIZE XLOW=0 ,XHIGH=128 ,XSIZ=129 ,YLOW=0 , 

YHIGH=64 , YSIZ=65 

CHANNELS PL0T=3 , 4 ,C0L0R=5 , 6 , 7 , 8 

The output file will contain: 

a. 129 samples per scan line with a maximum data resolution of 
128 on channel 3. 

b. 65 lines per file with a maximum data resolution of 64 on 
channel 4 . 
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c. 5 channels with channels 1 through 4 containing the color 
pixel (determined by channels 5, 6, 7 , and 8 on CHANNELS 
control card) and channel 5 the frequency. 

d. A color pixel for each positional pixel determined by 
channels 3 and 4 on the CHANNELS control card. 

If a MAPUNT file containing the subclass or cluster numbers is 
input to the NDHIST processor/ either a SAVTAP file related to 
the MAPUNT file must be input (see STATFILE control card) or 
the user must input the color codes on cards (see COLOR control 
card) . 

Optionally, a line printer pixel frequency scatter plot will be 
output (see PIXPLT control card) . The frequency of occurrence 
or log of frequency of occurrence will be represented by a symbol 
(see SYMBOL control card) . The location of the symbol on the 
plot will be determined by the radiance value of the pixel. If 
the data have been transformed, then the data must be rescaled 
to exercise this option. 

A functional flow diagram of the SCTRPL processor is given in 
figure 16-1. 

16.1 INPUT FILES 

The NHSTUN file created by NDHIST must be input. (See the HISFIL 
control card and appendix E for format. of the NHSTUN tape.) 

The SAVTAP file created by the STAT or ISOCLS processor option- 
ally may be input. (See the SlriTFILE control card and sec- 
tion 4.1 for a description of the file.) 
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16 . 2 OUTPUT FILES 


A multifile Universal-formatted tape containing the spectral 
plots and color keys, when applicable, will be output. (See the 
SCTRUN control card and appendix G for tape format.) 

16.3 SCRATCH FILES 

The program dynamically assigns random access disk storage for 
scratch files. 

16.4 CARD INPUT 

16.4.1 PROCESSOR CARD 

The processor keyword is left justified starting in column 1. 

For example, 

$SCTRPL 

This card directs the system monitor routine to select the 
SCTRPL processor and initiates loading of all the SCTRPL rou- 
tines into the system. 

16.4.2 SYSTEM CARD FILES 

The module STAT and B-matrix card files may be input. See sec- 
tion 3 for formats . 

16.4.3 CONTROL CARDS 

Table 16-1 lists the options and control cards for the SCTRPL 
processor. 

16.4.4 FIELD DEFINITIONS 

Field definitions do not apply to this processor. 
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16.5 CARD OUTPUT 


The SCTRPL processor does not provide punched card output. 

16.6 RESTRICTIONS 

The system-related restrictions in section 19 apply to this 
processor. 

If the color codes for the scatter plot tape SCTRUN are to be 
principal component (PC) colors, the user must ensure that the 
values are positive. 

The maximum dimension of the B-matrix is 2 by 16; the maximum 
number of elemer ts in additive vector b is 16 . 

The maximum number of channels on the output tape SCTRUN is 5. 
Color codes are the first n-1 channels; the frequency is the nth 
channel. 

The maximum number of channels selected from the SAVTAP file is 4. 

The maximum size of the output tape SCTRUN is 200 samples per scan 
line and 200 lines. 

16.7 DIAGNOSTIC MESSAGES 
16.7.1 SUBROUTINE LINPLT 

Message Explanatio n 

A TOTAL OF POINTS WERE Data may be rescaled to a 

NOT DISPLAYED ON THE LINE resolution of 100. 

PRINTER GRAPH. THE POINTS 
WERE OUT OF RANGE IN EITHER 
THE X DIRECTION OR Y 
DIRECTION. 


J 
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IS. 1.2 SUBROUTINE SETADR 


Message 

NOT ENOUGH DISK SPACE. 

TOTAL WORDS OF DISK 
SPACE = XXXXXXXXXXXX. 

TOTAL WORDS OF DISK 
SPACE = XXXXXXXXXXXX. 

16.7.3 SUBROUTINE SETH 
Message 

a. INVALID CONTROL CARD — 
IGNORED . 

b. ERROR ON CHANNELS CARD. 

c. ERROR ON STATFILE CARD. 

d. ERROR ON NHSTUN HISTOGRAM 
FILE CARD. 

e. ERROR ON OPTION CARD. 

f. ERROR ON OPTION CARD. 

g. ERROR ON TAPE SIZE CARD. 

h. ERROR ON SCATTER PLOT 
TAPE CARD. 

i. ERROR ON B-MATRIX CARD. 

j. ERROR ON SCALING CARD. 


Explanation 
Adjust parameters. 


Explanati on 

Check spelling of keyword. 

Check parameter field of 
CHANNELS card. 

Check parameter field of 
STATFILE card. 

Check parameter field of HISFIL 
card. 

Check parameter field of 
OPTION card. 

Check format and parameter. 

Check parameter field of 
SIZE card. 

Check parameter field of 
SCTRUN card. 

Check spelling and parameter. 

Check parameter field of 
SCALE card. 
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k. DATA MUST BE RESCALED 
BEFORE PIXEL FREQUENCE Y 
PLOT OPTION MAY BE 
SELECTED. 

l. NO. OF PLOTTING CHANNELS 

NO. OF B-MATRIX CHANNELS 
MUST BE EQUAL. CHANNELS 
ARE , RESPECTIVELY. 

Message 

m. ERROR IN POSITIONING 
NHSTUN FILE TO FILE 


16.7.4 SUBROUTINE SORTVC 

Message 

ERROR IN SORTING VECTORS. 

16.7.5 SUBROUTINE VECSCN 

Message 

ERROR OCCURRED SCANNING VECTOR 
CARD. 


Transformed data must be 
rescaled for line printer 
plot. 

Number of channels to be trans- 
formed must equal the number 
of channels in transformed 
matrix. 

Explanation 

Physical tape error occurred. 
Resubmit run. 

Explanation 
Self-exp lanatory . 

Explanation 

Check keypunching error on 
cards . 
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TABLE 16-1.- SCTRPL PROCESSOR OPTIONS AND CONTROL CARDS 


Keyword 

Parameter and 
default values 


(a) 

(b) 

Function 

CHANNELS 

n lf n 2 ,n 3 , • • • / n i 
i<number of channels 
on SAVTAP<30 
Default: First 

4 channels from NDIM 
file 

Statistics for these channels 
will be extracted from the 
SAVTAP file; n^ must be a sub- 
set of channels on the SAVTAP 
file. 

STATFILE 

UNIT«N,FILE=M 
Default: None 

N is the logical unit number 
assigned to the SAVTAP file; 
M is the number of the file 
to be processed. 

HISFIL 

UNIT=N 

Default: N=13 

N is the logical unit number 
assigned to the NHSTUN file. 

PIXPLT 

LOG 

Line printer pixel scatter 
plot of the log of frequency 
of occurrence will be printed. 

PIXPLT 

FREQ 

Line printer pixel scatter 
plot of the frequency of 
occurrence will be printed. 

PIXPLT 

RESCALE 
Default: No 

rescaling. XSIZ=101, 
YSIZ=101; the range 
for x-axis is XL0W+ 
XSIZ-1; the range for 
y-axis is YLOW+YSIZ-1. 

The frequency of occurrence 
of the pixel for the line 
printer scatter plot will be 
rescaled to ranges XLOW, XHIGH 
YLOW, and YHIGH. XSIZ will 
determine the number of bins 
on the x-axis; YSIZ, the 


a The keyword must be left justified in card columns 1 through 10. 

l- i 

The parameter and default values are in card columns 11 through 72 (beginning 
in any column past 10) . 


TABLE 16-1.— Continued. 


Parameter and 


Keyword 

default values 

Function 




number of bins on the y-axis . 
(See SIZE control cards.) 

COLOR 

(m 1 ) , (m 2 ) , • • • , (m p ) 
or 

m^=n^,n 2 / • • • ,n^ is the color 
assignment for cluster 1; 


L* (m 1 ) , 

K* (n» L+ ]_) 

m 2 =n l' n 2 ' * * * ,n l co ^ or 


L and K 

are integer 

assignment for cluster 2; 


repetition factors . 

nip=n^,n 2 , • • • , n^ is the color 


Default: 

No user 

assignment for cluster n. 


input of 

color codes 

p<60 and £<4; 0<n^<255. 

SIZE 

XSIZ=N 


The number of samples per line 


Default: 

XSIZ=101 

to output on the scatter plot 
tape; N<2Q0. 

SIZE 

YSIZ=N 


The number of lines to output 


Default : 

YSIZ=101 

on the scatter plot tape; 
N<200 . 

SIZE 

XHIGH=N 


The upper limit of the radi- 


Default: 

XHIGH=100 

ance value for the sample axis 
(x-axis) ; N<255. 

SIZE 

XLOW=N 


The lower limit of the radi- 


Default: 

XLOW=0 

ance value for the sample axis 
(x-axis) ; 0<N<XHIGH. 

SIZE 

YHIGH=N 


The upper limit of the radi- 


Default: 

YHIGH=100 

* 

ance value for the line axis 




(y-axis); N<255. 

SIZE 

YLOW=N 


The lower limit of the radi- 


Default: 

YLOW=0 

ance value for the line axis 




(y-axis); 0<N<255. 




TABLE 16-1.- Continued 


Keyword 

Parameter and 
default values 

Function 

PLOTAP 

UNIT=N 

Default: 

N=12 

N is the logical unit number 
assigned to the spectral plot 
tape . 

B-MATRIX 

CARDS 

Default: 

None 

The B-matrix is being input by 
cards . 

B-MATRIX 

FILE 

Default: 

None 

The B-matrix is being input by 
file. 

3VEC 

T l' T 2'-”' T n 
n=number of linear 

combinations in 

B-matrix<2 

Default: T w =0.0 

Elements of the additive 
vector to be used in the 
transformation; T is a 
floating-point: number. 

BCKGND 

N 

Default: 

N=255 

If N=0, background will be 
black; if N=255, background 
will be white. 


SCALE 

FILE 


The scale factors will be com- 
puted from the NHSTUN file. 

SCALE 

XMAX=T 

Default: 

computed 

file. c 

XMAX will be 
from the NDIM 

The upper range for the trans- 
formation of the sample values 
(x-axis) ; T is a floating- 
point number. 

SCALE 

XMIN=T 

Default: 

computed 

file. c 

XMIN will be 
from the NDIM 

The lower range for the trans- 
formation of the sample values 
(x-axis) ; T is a floating- 
point number. 


c If one of the parameters XMIN, XMAX, YMIN, or YMAX is input, all four param- 
eters must be input. 





TABLE 16-1.— Continued. 


Keyword 

SCALE 


SCALE 


SCALE 


HED1 


HED2 


DATE 


Parameter and 
default values 


YMAX=T 

Default: 


YMAX will be 


computed from the 
NHSTUN file. d 


YMIN=T 

Default: 


YMIN will be 


computed from the 
NHSTUN file. d 


RESCALE 

Default: No rescaling 

of the transformed 
data. 

Any 60 characters 
beginning in column 11 
Default: LYNDON B. 

JOHNSON SPACE CENTER 

Any 60 characters 
beginning in column 11 
Default: HOUSTON, 

TEXAS 

Any 12 characters 
beginning in column 11 
Default: Current date 


Function 

The upper range for the trans- 
formation of the line values 
(y-axis) ; T is a floating- 
point number. 

The lower range for the trans- 
formation of the line values 
(y-axis) ; T is a floating- 
point number. 

The transformed data will be 
rescaled to the range of XLOW, 
XHIGH, YLOW , and YHIGH. (See 
SIZE control card.) 

Replaces first header line 
with the indicated characters 
in the parameter field. 

Replaces second header line 
with the indicated characters 
in the parameter field. 

Prints the indicated 12 charac- 
ters in the right corner of the 
heading in place of the current 
date. 


d If one of the parameters XMIN, XMAX, YMIN, or YMAX is input, all four param- 
eters must be input. 




TABLE 16-1.— Concluded 


Keyword 

SYMBOLS 


MODULE 

COMMENT 

★ END* 
$END* 


Parameter and 
default values 

S 1' S 2' S 3' * * * /S k 
k<32 

Default: Two sets 

of 10 symbols 
overprinted 


Blank 


Function 

Character set separated by 
commas, with a maximum of 
32 characters. The number of 
symbols/2 determines the num- 
ber of bin levels. The first 
set of symbols is overprinted 
by the second set. A blank is 
not a legitimate character. 

Initiates the input of the mod- 
ule STAT card file which immedi- 
ately follows this card (see 
section 3. 1.4.1 for format). 


Any 60 characters Prints a comment line using 

beginning in column 11 the 60 characters found in the 
Default: No comments parameter field, 

printed 


Blank 


Signals the end of the control 
cards . 


Blank Signals the end of all card 

input for the processing 
function. 


»» 

an 


\ 

\ 


Figure 16-1.— Functional flow chart for the SCTRPL processor. 
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17. DOT DATA PROCESSOR - DOTDATA 


17.1 PROCEDURES 

In implementing Procedure 1, a method which allows the user the 
capability to label certain MSS data points corresponding to 
pixels (known as dots) was added to the system by the processor 
DOTDATA. The main function of this processor is to output a 
file containing the dots of interest. This file is an interface 
for three processors, ISOCLS, LABEL, and DISPLAY. 

The dots are defined by field cards. Any subset of the 209 pos- 
sible grid points may be selected by the user. Optionally, the 
dots may be labeled at the initial creation of the file or in 
the LABEL processor. All the categories of interest do not have 
to be defined in this processor. 

If a dot is to be retyped, the file must be re-created by execut- 
ing this processor again. 

The files contain both type 1 and type 2 dots. Type 1 dots 
(starting and labeling dots) are written on one file; type 2 
dots (bias correction dots) are written on a second file. 

By an OPTION control card, the user may request that the spatial 
and spectral information relating to each dot on the file be 
printed on the line printer. 

17.2 INPUT FILES 

This processor requires an MSS data tape (DATAPE) . The tape 
assignment defaults to logical unit 11; but by input of the 
DATAFILE control card, the user may assign any available logi- 
cal unit. (See section 3.2 for further information on format.) 
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17.3 OUTPUT FILES 


DOTUNT, a multifile unformatted FORTRAN written tape, is output 
as an interface to the processors ISOCLS, LABEL, and DISPLAY. 

The default unit for DOTUNT is logical unit 19. (See appendix H 
for format of the tape.) The logical unit and file number can 
be controlled by the DOTUNT control card. 

17 * 4 SCRATCH FILES 

The DOTDATA processor does not require an additional scratch file. 
17.5 CARD INPUT 

The processor keyword is left justified, beginning in column 1: 

$ DOTDATA. This card directs the system monitor volume to select 
the DOTDATA processor and causes all the routines used by it to 
be loaded into the system. 

17.5.1 SYSTEM CARD FILES 

The DOTDATA processor does not use any special input files. 

17.5.2 CONTROL CARDS 

Control cards allow the user to input various options. These 
cards are identified by a keyword left justified in columns 1 
through 10 of the card, with the parameter values or additional 
keywords in columns 11 through 72 (beginning in any column after 
column 10). These control cards may be in any order, but they 
must be the first cards after the processor card $HIST. Table 17-1 
lists all available options, along with their default values. 

17.5.3 FIELD DEFINITIONS 

The user defines the grid points to extract from the MSS data 
tape by the field card. The definition and order of the field 
card(s) determines the position of the dots on the dot data 



file, DOTUNT. The analyst will need to knew the position of the 
dots for defining starting vectors in ISOCLS and for labeling or 
relabeling the dots in LABEL. 

At the time of defining the fields, the type for each dot is 
defined by the TYPE card. By option, the analyst may label each 
dot by a CLASSNAME card. If this card is omitted, the unlabeled 
dots must be labeled by the control card DOTLABEL or excluded 
from the set by the control card EXCLUDE in the labeling proces- 
sor , LABEL . 

An example of a field data set which is expected by this proces- 
sor follows. 

*END 
TYPE 

CLASSNAME 
LABI 
LAB 2 

CLASSNAME 
LAB 3 
TYPE 

CLASSNAME 
BIAl 

CLASSNAME 
BIA2 
$END 

Two files are written. File 1 contains 38 WHEA dots, followed 
by 10 NONW dots; all of which are type 1 dots. File 2 contains 
19 WHEA dots, followed by 19 NONW dots; all of which are type 2 
dots . 

Note; All names on CLASSNAME cards are read from col. 11 
through 15. 


1 

WHEA (optional) 

(10.10) , (10,20) , (196,10) 

(10.10) , (10,20) , (196,20) 
NONW (optional) 

(10.10) , (10,50) , (100,50) 
2 

WHEA (optional) 

(10.10) , (10,40) , (196,40) 
NONW (optional) 

(10.10) , (10,70) , (196,70) 




* 


If the CLASSNAME cards were omitted, file 1 would contain 48 
unlabeled type 1 dots. File 2 would contain 19 unlabeled type 2 
dots. 

In both cases, the reference number for the dots in file 1 
defined by LABI field card is 1 through 19, the LAB2 field 
card reference number is 20 through 38, and LAB3 field card is 
39 through 48. The reference number for the dots in file 2, 
defined by BIA1 field card, is 1 through 19, and for BIA2 
field card, 20 through 38. 

If the LACIE dot input option is invoked, there are no TYPE or 
CLASSNAME card images . Each input card image has the form 

A\ 

B I 

: | ^]_ > n 2 / • • • / 

z ) 

DOT begins in column 1. The dot type (1 or 2) appears in 
column 5. A one-character category name appears in column 7. 

17.6 CARD OUTPUT 

DOTDATA does not produce any card decks. 

17.7 RESTRICTIONS 

System restrictions apply to DOTDATA. 

17.8 D IAGNOSTIC MESSAGES 
17.8.1 SUBROUTINE DOTS 

Message 

**** NOTE - TOTVEC WAS GREATER 
THAN 250, THEREFORE TOTVEC WAS 
SET TO 250 ****. 


Explanation 

Total number of dots allowable 
is 250 of Type 1 and 250 of 
Type 2 . 




17.8.2 SUBROUTINE SET13" 



Check parameter field. 

Check parameters . 

Check format and parameters . 




TABLE 17-1.- DOTDATA -PROCESSOR OPTIONS AND CONTROL CARDS 


Keyword 

(a) 

Parameter and 
default values 

(b) 

Function 

CHANNEL 

DATA=n 1 ,n 2 , • • • ,n 3Q 
Default: None 

Integer numbers separated by 
commas referring to the chan- 
nels on the MSS data tape. 

DATAFILE 

UNIT=n,FILE=m 
Default: n=ll ,m=l 

n is the Fortran unit number 
assigned to the MSS data tape; 
m is the file number of the 
data to process. 

DOTFIL 

OUTPUT/UNIT=n,FILE=m 
Default: n=19 ,m=l 

n is the Fortran unit number 
assigned to the dot data file 
output by this processor; m is 
the number of the file to 
output . 

OPTION 

PRINT 

Default: no line 

printer output. 

Initiates the printing of the 
dot data file information. 

OPTION 

LAC IE 

Enables user to input dots 
according to the LACIE format. 

HED1 

Any 60 characters 
Default: LYNDON B. 

JOHNSON SPACE CENTER 

First line of the heading on 
line printer output. 

HED2 

Any 60 characters 
Default: HOUSTON, 

TEXAS 

Second line of the heading on 
line printer output. 


a The keyword must begin in column 1. 

^The parameter data can extend from column 11 to column 72. 
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TABLE 17-1.— Concluded. 


Keyword 

Parameter and 
default values 


(a) 

(b) 

Function 

DATE 

Any 12 characters 
Default: Present 
date 

Date in the heading on line 
printer output. 

COMMENT 

Any 60 characters 
Default: None 

Comment printed with heading 
on line printer output. 

*END 

Blank 

Indicates the end of the con- 
trol card inputs. 

$END 

Blank 

Indicates the end of all the 
card inputs. 


a The keyword must begin in column 1. 

^The parameter data can extend from column 11 to column 72. 




$DOTDATA 



Figure 17-1.— Functional flow chart for the DOT DAT A processor. 




18. AUTOMATIC CLUSTER LABELING PROCESSOR - LABEL 


To aid the analyst in supervising the labeling of the statistics 
obtained from the clustering processor ISOCLS, a new technique 
for labeling the statistics was implemented. 

Two procedures for labeling the statistics are provided. Option- 
ally, the analyst may select either the k-nearest-neighbor proce- 
dure or the all-of-a-kind procedure (fig. 18-1) . 

The labels in the dot data file, DOTUNT, and/or previously labeled 
statistics file, SAVTAP, may be relabeled by control card input 
(table 18-1) and the updated file(s) output to tape. Optionally, 

(1) a conditional or mixed cluster map may be output to tape; 

(2) an unconditional cluster map may be output in the format 
acceptable by the DISPLAY processor; (3) the spatial and spectral 
information of the relabeled DOTUNT may be output to the line 
printer; and (4) the statistics of the SAVTAP file may be output 
to the line printer. 


18.1 PROCEDURES 

A distance table containing the L^ or distance of all the 
type 1 dots and the mean of the clusters is computed. 


L 


1 




(18-1) 


L 


2 




where 

- ith element of the dot vector 
u^ = ith element of the mean vector 
n = number of channels 


(18-2) 


U-T 


Using the table as input to the k-nearest-neighbor procedure or 
all-of-a-kind procedure, the statistics generated during cluster 
ing are labeled. 

For the k-nearest-neighbor procedure, all the labels of the 
k-nearest labeling dots to a given cluster are polled. The 
label with the majority of the dots labels the cluster. If 
a tie occurs, then k-1 dots are considered. 

E For the all-of-kind procedure, all of the labeling dots within a 

cluster are polled. If all the dots are of one category, the 
cluster labels that category. If the cluster contains dots for 
more than one category, the label with the majority of the dots 
labels the cluster. If there are no labeling dots within a 
cluster, the labeling defaults to the k-nearest-neighbor 
procedure . 

Optionally, a conditional cluster map may be output. A cluster 
is tagged as conditional if the distance between the nearest 
identically labeled labeling dot and mean of the cluster is 
greater than the analyst input threshold value t. 

Optionally, a mixed cluster map may be output. A cluster is 
tagged as mixed if the labeling dots within a cluster are of 
more than one category. 

Optionally, an unconditional cluster map may be output in the 
format acceptable to the DISPLAY processor. Information used in 
the thresholding procedure in DISPLAY is dummied. If threshold- 
ing of the clustered data is desired, it can be performed by 
exercising the conditional map option in this processor. 
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18.2 INPUT FILES 


Optionally, the three files listed below may be input. 

a. Statistics file (SAVTAP) from ISOCLS or STAT processor 

b. Cluster pap file (MAPUNT) from ISOCLS processor 

c. Dot data file (DOTUNT) from DOTDATA processor 
For format of the files, see sections 4.1 and 5.1. 

18.3 OUTPUT FILES 

Optionally, the following files are output. 

a. Labeled statistics file (SAVTAP) by using a procedure (See 
section 4.1 for format of file.) 

b. Labeled statistics file (SAVTAP) by control card input 

c. Relabeled dot data file (DOTUNT) 

d. A conditional cluster map (See section 5.1 for format of 
file. ) 

e. A mixed cluster map 

f. An unconditional cluster map in the format acceptable by the 
DISPLAY processor (See appendix C for format of file.) 


A line printer summary of the following is output. 

a. Summary of selected options 

b. Table of L. or L» distances 

X X 

c. Summary of the labeling dots within a cluster for the all- 
of-a-kind procedure 

d. Summary of the labeling dots for the k-nearest. dots to a 
cluster for the k-nearest-neighbor procedure 
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e. Spatial and spectral information of relabeled DOTUNT file 

f. Mean and covariances of labeled or relabeled statistics 

18.4 SCRATCH FILES 

LABEL does not require an additional scratch file. 

18.5 CARD INPUT 

Formats for all system card input are defined in section 3.1. 

18.5.1 PROCESSOR CARD 

The processor keyword is left justified starting in column 1: 

$ LABEL (Loads into the system all the routines needed for execut- 
ing this processor.) 

18.5.2 SYSTEM FILES 

The LABEL processor does not use any special input decks. 

18.5.3 CONTROL CARDS 

Control cards allow the user to input various options . These 
cards are identified by a keyword, left justified in columns 1 
through 10 of the card, with the parameter values or additional 
keywords in columns 11 through 72 (beginning in any column after 
column 10) . These control cards may be in any order, but they 
must be the first cards after the processor card LABEL. Table 18-1 
lists all available options, along with their default values. 


18.5.4 FIELD DEFINITIONS 


If the all-of-a-kind procedure is selected, a conditional or 
mixed cluster map is output, or a DISPLAY interface tape is out- 
put, a MAPUNT tape must be input and a field card defining the 
area of the unconditional cluster map (input MAPUNT) must also 
be input. The vertices must reflect the sample number and line 
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number of the MSS data tape used to create the unconditional 
cluster map. That is, the field card must be identical to 
the field card input to ISOCLS. 

If a procedure is not to be executed, a MAPUNT file is not out 
put, or a MAPTAP file is not output, a field card is not input 

Example of the data set: 

• MAPUNT tape is being input. 

★END 

FIELD (1,1) , (1,1) , (196,1) , (196,117) , (1,117) 

$END 

• MAPUNT tape is not being input. 

★END 

$END 

18.6 CARD OUTPUT 

This processor does not produce cards. 

18 . 7 RESTRICTIONS 

General system restrictions apply to LABEL. 

18 . 8 DIAGNOSTIC MESSAGES 
18.8.1 SUBROUTINE ALLKIN 

Message 

a. LABELING BY ALL-OF-A-KIND 
PROCEDURE . 

b. ** DEFAULTING TO K-NEAREST 
NEIGHBOR PROCEDURE ** • 

C. A TIE OCCURRED. THE FOLLOWING 
DOTS WERE DISCARDED. 


Supervisory Messages. 


18.8.2 SUBROUTINE FILEKD 


Message 

a. NOT ENOUGH CORE TO STORE 
DOTFILE. 

b. NOT ENOUGH CORE OR DISK 
SPACE OF CLUSTER MAP INFO. 

18.8.3 SUBROUTINE KNEAR 

Message 

a. LABELING BY XXX NEAREST > 

NEIGHBOR PROCEDURE. ^ 

b. A TIE OCCURRED. THE 

FOLLOWING DOT (S ) WERE I 

DISCARDED. , 

18.8.4 SUBROUTINE LABLR 

Message 

CATEGORIES HAVE NOT BEEN 

DEFINED. 

18.8.5 SUBROUTINE MANORD 

Message 

a. ERROR IN INPUT OF CLASS 
NAMES. NAMES ON 3 TAT FILE 
ARE: 

b. NAMES INPUT ARE: 


Explanation 
Self-explanatory . 

Self-explanatory . 


Explanation 


Supervisory Messages. 


Explanation 

Check spelling and keywords 
on CATEGORY control card. 


Explanation 
Self-explanatory . 


Self-explanatory . 
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18.8.6 SUBROUTINE SET14 


a. INVALID CONTROL CARD - 
IGNORED. 

b. ERROR ON CHANNELS CARD. 

c. ERROR ON DATAFILE CARD. 

d. ERROR ON MAPFIL CARD. 

e. ERROR ON MAPFIL CARD. 

f. ERROR ON OPTION CARD. 

g. ERROR ON STATFILE CARD. 

h. ERROR ON PROCEDURE CARD. 

i. ERROR ON MAPTAP CARD. 

j. NO. OF STAT CHANNEL AND 
DOT DATA CHANNELS MUST 
BE EQUAL. 

k. A LABELING PROCEDURE MAY 
NOT BE CHOSEN WHEN UPDATING 
THE DOTUNT OR SAVTAP FILES. 

l. USER HAS NOT INPUT ONE OF 
THE REQUIRED FILES: 

SAVTAP MAPUNT OR 
DOTUNT . 

18.8.7 SUBROUTINE STOMA 15 
Message 

NOT ENOUGH DISK SPACE TO STORE 

DAS TAPE DATA. 


Explanation 

Check spelling of keyword. 

Check format and parameters. 
Check format and parameters. 
Check format and parameters . 
Check format and parameters. 
Check parameter . 

Check format and parameters. 
Check spelling and parameter. 
Check format and parameters. 
Self-explanatory . 

You may want to go through 
$LABEL again after FILES 
have been updated. 

Self-explanatory . 


Explanation 

A DAS tape output unit should 
be assigned. 
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TABLE 18-1.- LABEL PROCESSOR OPTIONS AND CONTROL CARDS 


Keyword 

Parameter and 
default values 


(a) 

(b) 

Function 

CHANNEL 

STAT=n lf n 2 , • • • ,n 3Q 
DATA=m 1 ,m 2 , • • • ,m 3 g 
Default: 

n^ » all channels on 
SAVTAP file 
m^ = all channels on 
DOTUNT file 

n^ and hk are integer numbers 
separated by commas referring 
to the channels on the SAVTAP 
file and the DOTUNT file, 
respectively . 

DOTFILE 

INPUT/FILE=m , UNIT=n 
Default: None 

Defines the unit and file for 
the input dot data file, 
DOTUNT . 

n is the Fortran unit number 
assigned to the input DOTUNT 
file. 

m is the number of the input 
file to process. 

DOTFILE 

OUTPUT/FILE=m, UNIT=n 
Default: None 

Defines the unit and file to 
which the relabeled D01UNT 
file is output, 
n is the Fortran unit number 
assigned to the output DOTUNT 
file . 

m is the number of the file 
to output. 

STATFILE 

INPUT/FILE=m , UNIT=n 
Default: None 

Defines the unit and file for 
the input SAVTAP file. 


a The keyword must be left justified in columns 1 through 10. 

^The parameter and default values are in card columns 11 through 
72 (beginning in any column past 10) . 


TABLE 18-1.— Continued 


Keyword 


STATFILE 


MAPTAP 


MAPFILE 


Parameter and 
default values 


OUTPUT/FILE=m , UNIT=n 
Default: None 


OUTPUT/FILE=m,UNIT=n 
Default: No DISPLAY 

interface tape will 
be output. 


INPUT/FILE=m , UNIT=n 
Default: m=l,n=16 


Function 

n is the Fortran unit number 
assigned to the input SAVTAP 
file . 

m is the number of the file 
to process. 

Defines the unit and file to 
which the labeled/relabeled 
SAVTAP file is output, 
n is the Fortran unit number 
assigned to the output SAVTAP 
file . 

m is the number of the file 
to output. 

Defines the unit and file to 
which the unconditional clus- 
ter map, MAPTAP, is output, 
n is the Fortran unit number 
assigned to the output MAPTAP 
file . 

If executing back to back with 
DISPLAY, n must be assigned to 
unit 2 . mis the number of 
the file to output. 

Defines the unit and file for 
the input unconditional clus- 
ter map, MAPUNT, that is out- 
put by ISOCLS during the 
clustering process. 


J 


TABLE 18-1.— Continued. 


Keyword 


MAPFILE 


DOTLABEL 


c If both 
on file 


Parameter and 
default values 


OUTPUT/FILE=m , UNIT=n 
Default: n=16,m=l 


Category name, 

n l ,n 2 ' * * ’ ' n 250 
Default: None 


Function 

n=16 is the Fortran unit 
number assigned to the input 
MAPUNT file. (n must be 
assigned to unit 16 if exe- 
cuting back to back with 
ISOCLS) . m is the number 
of the file to process. 

Defines the unit and file to 
which the conditional and/or 
mixed cluster map is output, 
n is the Fortran unit number 
assigned to the output MAPUNT 
file . 

m is the number of the file 
to output. 0 

The DOTUNT file is labeled 
or relabeled by this card. 
Category name is the label 
the analyst is assigning to 
the dots n^ , j=l. The 
category name may be composed 
of a maximum of 6 characters, 
n^ are integer numbers sepa- 
rated by commas referring to 
the position of the dot on 
the DOTUNT file. 


type of maps are output, the conditional map is output 
m; the mixed map, on file m+1. 
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TABLE 18-1.— Continued 


Keyword 

STALABEL 


DISTANCE 

DISTANCE 

OPTION 

OPTION 

THRESHOLD 


Parameter and 

default values Function 


Class name/ n^, 

n 2 / • • * ' n 250 
Default: None 


The SAVTAP file may be manu- 
ally relabeled by this card, 
n j ( j=l,2, • • • ,i) are the 
number of the subclasses on 
the SAVTAP that are to be 
regrouped into another class. 
Class name is the name of the 
class to which the subclasses 
n^ are to be reassigned. 

Class name must match a name 
on the SAVTAP file. 


LI 

Default: L^ distance 


L2 

Default: L^ distance 


COND 

Default: 

MIXED 

Default: 


None 


None 


The distance is used in 
computing the distance between 
the means of the cluster and 
the labeling dots. 

The L 2 distance is used in 
computing the distance between 
the means of the cluster and 
the labeling dots . 

A conditional cluster map is 
output. 

A mixed cluster map is output. 


T T is the threshold parameter 

Default: T=25.0 used in creating the condi- 

tional cluster map. T is a 
floating-point number. 


JS-MT' 

Z37 


TABLE 18-1.- Continued. 


Keyword 

NEAREST 


PROCED 


MODULE 


EXCLUDE 


SUNANG 


Parameter and 

default values Function 


K K is the number of dots to be 

Default: K=1 used in the k-nearest~neighbor 

procedure. K is an integer 
number. k<250. 


NAME NAME is an alpha word. 
Default: N=K-NEAREST NAMH=K-NEAREST (Use the k- 

nearest-neighbor procedure.) 
NAME=ALL (Use the all-of-a- 
kind procedure . ) 

NAME=MANUAL (Use the manual 
procedure of relabeling the 
DOTUNT or SAVTAP file. 


Blank Initiates the input of the 

module STAT card file. The 
deck must immediately follow 
this card. 


n l' n 2 ' * ’ ’ ' n 25o 
Default: All dots 

on the DOTUNT are 

used. 

m^ 1 * * * 

Default: No Sun 

angle correction is 
to be applied. 


n^ are integers numbers refer- 
ring to the dots on the DOTFIL 
that are to be excluded in all 
calculations (i.e., dots 
within a DO/DU area) . 

irij are integer Sun angle 
numbers used in computing the 
L^ or L 2 distances. A Sun 
angle must be input for each 
acquisition of interest. An 
acquisition is assumed to be 
a 4 channel pass. 


TABLE 18-1.— Continued 


Keyword 

Parameter and 
default values 

Function 



Example: If the distance is 

computed using 16 channels, 

4 Sun angles (m^ ,m 2 ,m^ ,m^) 
must be input. 

SUNANG 

FILE 

Default: No Sun 

angle correction is 
applied. 

Sun angles will be extracted 
from the DOTUNT file. 

OPTION 

STATS 

Default: Statistics 

not printed. 

Means and covariances for 
labeled or relabeled statis- 
tics on SAVTAP file are 
printed. 

OPTION 

DOTS 

Default: Relabeled 

DOTFIL is not 
printed. 

Spatial and spectral infor- 
mation of relabeled DOTUNT 
file is printed. 

HED1 

Any 60 characters 
Default: LYNDON B. 

JOHNSON SPACE CENTER 

First line of the heading on 
line printer output. 

HED2 

Any 60 characters 
Default: HOUSTON, 
TEXAS 

Second line of the heading on 
line printer output. 

DATE 

Any 12 characters 
Default: Present 
date 

Date in the heading on line 
printer output. 

COMMENT 

Any 60 characters 
Default: None 

Comment printed with heading 
on line printer output. 
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TABLE 18-1.— Concluded. 


Keyword 

Parameter and 
default values 

Function 


*END 

Blank 

Indicates the end of 

the con 



trol card inputs. 


Send 

Blank 

Indicates the end of 

all the 



card inputs. 
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Figure 18-1.- Functional flow chart for the LABEL processor 





19. SYSTEM RESTRICTIONS 


The system is limited in every processor to processing no more 
than 30 channels of data. The MSS data tape (DATAPE) may have 
more than 30 channels, but for processing purposes a subset of 
those channels must be selected via the CHANNELS control card. 

A maximum of 60 categories, classes, and subclasses may be 
processed. However, the maximum number of channels and sub- 
classes may not be processed at one time. The arrays within 
the system are dimensioned variably according to user requests. 
The amount of storage available will not accommodate the arrays 
which are dimensioned number of subclasses by number of chan- 
nels, if both maximums are used. Restrictions under the STAT, 
SELECT, and CLASSIFY processors allow the user to compute 
approximately whether or not the numbers of channels and sub- 
classes selected are acceptable. When core storage require- 
ments are exceeded, a diagnostic message is printed, and the 
user must reduce his requirements in order to get a successful 
execution. Restrictions specific to individual processors are 
noted in the documentation for each processor. 

Owing to the virtual storage characteristics of VM/370, arrays 
can be enlarged by the System Maintenance Group at user request. 
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SYSTEM DIAGNOSTIC MESSAGES 
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APPENDIX A 


UTILITY SUBPROGRAMS - DIAGNOSTIC MESSAGES 

The diagnostic messages listed in this section are printed by 
utility subprograms used by more than one processor. Diagnos- 
tics which occur only in a given processor are listed in the 
sections for the individual processor. 

SUBPROGRAM; Message Explanation 


BUFILL ; 

XXXX BYTES EXPECTED. 

XXXX BYTES ON RECORD. 

CLSCHK : 

a. ** CLSCHK** - REQUESTED 
SUBCLASS NO. XXX IS NOT 
AVAILABLE IN INPUT 
STATISTICS — REQUEST 
IGNORED . 


b. ** CLSCHK** - REQUESTED 
SUBCLASS NO. XXX FOR 
GROUP NO. XXX IS NOT 
AVAILABLE IN INPUT 
STATISTICS FILE. 


Self-explanatory . 


Either by the SUBCLASS control 
card or by default, a subclass 
number has been requested to be 
used in classification, which 
is greater than the largest sub- 
class number available in the 
input training subclasses. The 
CLASSIFY processor ignores the 
requested subclass number and 
deletes it as a possibility for 
use in classification. 

A subclass number input by 
either the SUBCLASS or the 
GROUP control card is greater 
than the largest training sub- 
class number available. The 
requested subclass will be 


a** 


SUBPROGRAM; Message 


Explanation 

deleted and ignored in 
classification . 


***** CLSFY/FETCHK 

CHANNEL XXX NOT IN TRAIN- 
ING DATA TRAINING 

DATA CHANNELS ARE 

C 1' C 2' C 3' ’ * * ' C N* 


***** CHANNEL XX IGNORED 
(NOT USED) IN 
CLASSIFICATION. 


A channel requested on the 
CHANNELS control card or in the 
B-matrix input for use in clas- 
sification is not available .in 
the set of channels which was 
used to obtain training subclass 
statistics. The available set of 
channels in the training statis- 
tics is printed out as part of 
the diagnostic message. Mes- 
sages d and e are added to this 
when the B-matrix is not 
involved (d) and when B-matrix 
channels are input (e) . 

If the B-matrix is not being 
input to the CLASSIFY processor 
(i.e., B-matrix channels are not 
involved) , the requested channel 
in the previous message will be 
deleted from the list of chan- 
nels and ignored by the CLASSIFY 
processor. In this case, this 
message is added to the previous 
diagnostic message. 


***** b-MATRIX CHANNELS 
MUST BE EQUAL TO OR A SUB- 
SET OF AVAILABLE TRAINING 

DATA CHANNELS THE 

INPUT B-MATRIX CHANNEL 
SET IS B^ /B 2 / * * * 


If the B-matrix is input to the 
CLASSIFY processor, the B-matrix 
channels become the set which is 
to be used in classification; 
and if one of the B-matrix chan- 
nels is not available in the 


SUBPROGRAM: Message 


CMERR : 

ERROR HAS OCCURRED. 

CRDSTA : 

EXCEEDED CORE LIMITS. REDUCE 
NO. OF TRAINING CLASSES OR 
FEATURES. EXITING FROM CRDSTA. 


FLDINT : 

a. FEATURE NUMBERS XXXXX AND 
ABOVE ARE NOT ON DATA 
TAPE. 

b. FIRST SCAN ON THIS TAPE 

IS NUMBERED XXXXXX. FIELD 
DEFINITION IN ERROR. 

C. NUMBER OF SAMPLES OF PER 
SCAN ON THIS TAPE IS 
XXXXXX. FIELD DEFINITION 
IN ERROR. 

d. THIS TAPE CONTAINS ONLY 
XXXXXX CHANNELS. 


Explanation 

training subclass statistics, 
the processor cannot continue. 

In this case, this will be the 
remainder of the error message. 

Self-explanatory . 

The combination of total number 
of channels, subclasses, and 
training fields must be reduced 
to fit in the internal core 
storage available to the proc- 
essor. Total storage is 10 600 
locations . 

User has requested a channel not 
on MSS DATAPE. 

Self-explanatory . 


Self-explanatory . 


Self-explanatory . 
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SUBPROGRAM; Message 


FSBSFL ; 

FSBSFL ONLY SKIPS FORWARD. 
FSFMFL: 

FSFMFL ONLY SKIPS FORWARD. 
GETST : 

a. ERROR IN POSITIONING 
UNIT XXX TO FILE XXX. 

b. REQUESTED SUBCLASS IS NOT 
ON STAT FILE. STAT FILE 
CONTAINS XXX SUBCLASSES. 

C. CHANNEL NO. XX IS NOT ON 
TRAINING STAT FILE. 
CHANNELS ARE 

, C2 t C2 r * * * i Cjj • 

GRPSCN : 

///// FROM SUBR. GRPSCN 

CLASS XXXXX INCORRECT 

CLASS XXXXX IGNORED. CARD 
BEING SCANNED IS XXXX***XXXX. 


1 



One of the class numbers listed 
on the GROUP control card has 
one or more of the following 
three conditions: (1) not in 

ascending order, (2) greater 
than the largest class number 
allowable (30) , or (3) has 
already been used in another 
GROUP card. The erroneous 
GROUP control card is printed 
as part of the message. The 
processor will delete the 
erroneous class number from 
the list and proceed grouping 
all other listed classes. 
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SUBPROGRAM; Message 


Explanation 


HISTGM : 

TOO MUCH DATA REQUESTED — 

SAMPLE END WAS RESET TO XXXXX. 

I4A1BN : 

EBCDIC TO BINARY INTEGER CON- 
VERSION ERROR AT CHARACTER 

XXXXX OF XXXXX CHARACTER 

FIELD; XXXX***XXXX. 

LABMAN ; 

a. ERROR IN POSITIONING SIG. 
EXTENSION TAPE TO FILE 
XXX. OUTPUT FILE NOT 
WRITTEN . 

b. THE STATISTICS FILE FOR 
XXXX CLASSES AND XXXX 
SUBCLASSES HAS BEEN 
WRITTEN. 

C. THE STATS WERE WRITTEN 
ON FILE XXX. 

d. THE STATS FOR A PARTICULAR 
CLASS OR SUBCLASS SHOULD BE 
REFERRED TO IN LATER RUNS 
BY THE FOLLOWING NUMBERS 
(WHICHEVER APPLICABLE) . 

LAREAD: 

a. ERROR IN FIELD CARD TER- 
MINATING RUN. 


Self-explanatory . 


Self-explanatory. 


Self-explanatory. 


Self-explanatory. 


Self-explanatory . 
Self-explanatory. 


A field description card has an 
incorrect format. All vertices 
must be separated by commas and 
enclosed in parentheses, and 




SUBPROGRAM: Message 


b. INCORRECT FIELD CARD, 
TERMINATING RUN. 

LINERD : 

a. FIELD BOUNDARY FOR THIS 
FIELD DEFINED LEYOND SCOPE 
OF DATA. THIS FLIGHT LINE 
CONTAINS XXXXXX SCAN LINES. 

b. FLDINT MUST BE CALLED TO 
INITIALIZE PARAMETERS FOR 
A NEW FIELD. 

RDDOTS : 

a. CHANNEL XX IS NOT ON 
DOTFIL . CHANNELS ARE 

C 1' C 2' * * ’ /C N ’ 

b. DOT NO. XXX IS NOT ON 
DOTFIL. FILE CONTAINS 
XXX DOTS. 

RDMODK : 

ERROR IN TRYING TO POSITION 
STAT FILE TO FILE XXX IN 
CRDSTA. 


Explanation 

sample and line numbers must be 
integers. The card which caused 
the error is printed out with 
this message. 

Same. 


User has requested scan line not 
on MSS DATAPE. 


For every field input there must 
be a call to FLDINT to reset 
parameters for positioning the 
MSS DATAPE. 

Self- explanatory . 


Self-explanatory. 


An error occurred in position- 
ing the SAVTAP file, and no 
statistics were written. 
Resubmit the run. 


SUBPROGRAM: Message 

REDDAT : 

CHANNEL NO. XX IS NOT A 
TRAINING CHANNEL. XX TRAIN- 
ING CHANNELS ARE C. ,C 0 , • • • ,C„ . 

12 N 

REDSAV : 

a. STAT FILE WAS NOT CREATED. 
EXITING FROM **REDSAV**. 


b. ERROR IN POSITIONING STAT 
FILE TO FILE XXX. EXITING 
FROM REDSAV. 

C. USER HAS REQUESTED XX 

CHANNELS, XX SUBCLASSES, 
AND XX CLASSES. THIS 
COMBINATION OF STATS WILL 
NOT FIT IN CORE. PLEASE 
REDUCE REQUEST. 


Explanation 
Self- explanatory . 


An error occurred in positioning 
the SAVTAP file, and no statis- 
tics were written. Resubmit 
run. 

Same . 


The fixed amount of internal 
core storage available to the 
processor for storing class 
descriptions , number of sub- 
classes in each class, subclass 
descriptions, field information, 
vertices , covariances , means , 
and working area has been 
exceeded. The total amount of 
storage available for the above 
information is 10 600 locations.^" 
Reduce the requested combination. 


The equation for computing the required storage is: STORAGE = 

2 (number of classes) + (number of subclasses) + 4 (number of 
fields) + 2 (total number of vertices for all the fields) + 
(number of subclasses + 1) [(number of channels) (number of 
channels + l)/2] + (number of subclasses) (number of channels). 
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SUBPROGRAM; Message 


Explanation 


SEARCH : 

a. SEARCHING FOR LINE. 

b. RECORDS PER SCAN, XXXXX. 
SCANS PER RECORD, XXXXX. 

C. FOUND IT AFTER XXX TRIES. 

d. FAILED AFTER XXXXX TRIES — 
ABORTING . 

e. SCAN XXXXX IS MISSING — 
USING PREVIOUS SCAN 
INSTEAD . 

TAPHDR: 

a. UNRECOVERABLE ERROR READ- 
ING HEADER RECORD. 

b. A LINE NO. IS LESS THAN 
OR EQUAL ZERO. 

C. LAST SCAN LINE READ XXXXX. 
ISTAT = XXXXX. 

d. INTERNAL DIMENSIONS TOO 
SMALL FOR DATA. NUMBER 

OF CHANNELS ON DATA TAPE = 
XXXXXXX . NUMBER OF 
POINTS/CHANNEL = XXXXXXX. 

e. CHECK THE FOLLOWING POS- 
SIBLE ERRORS. 1. DATA 
TAPE IS NOT IN UNIVERSAL 
OR LARSYS FORMAT. 


Self-explanatory . 
Self-explanatory . 

Self-explanatory . 
Self-explanatory. 

Self-explanatory . 


Error occurred while trying to 
read header record. 

The first line number on the 
data tape is less than or equal 
to zero. 

Self-explanatory. 

The maximum record size of the 
data record exceeded 6800 words. 


The MSS DATAPE must be in 
LARSYS III or Universal format. 


SUBPROGRAM: Message 


f. ONLY ONE OR LESS RECORDS 
PER CHANNEL ACCEPTABLE 
AT THIS TIME. 

g. NO. OF RECORDS PER DATA 
SET = XXXXX. MUST BE 
LESS THAN OR EQUAL 15. 

h. NO. OF BITS /PIXEL = XXXXX. 
ONLY 8 BITS ACCEPTABLE AT 
THIS TIME. 


i. DATA ORDER INDICATOR = 
XXXXX. DATA MUST BE 
ORDERED BY PIXEL. 

WRTHED : 

NUMBER OF SAMPLES WAS RESET 
TO 2998. 


Explanation 

All of the samples of one chan- 
nel must be contained within 
one record. 

One data set cannot contain 
more than 15 records. 

According to the header record, 
the samples on the MSS DATAPE 
do not equal eight bits. It is 
assumed that the header record 
is in error, and execution 
continues . 

Information from header records 
indicates data are not ordered 
samples by channels. 

Number of samples per line 
cannot be more than 2998. 
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APPENDIX B 

LARSYS III FORMAT FOR AN MSS DATA 
STORAGE TAPE 
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APPENDIX B 

LARSYS III FORMAT FOR AN MSS DATA 
STORAGE TAPE 

This is the third version of the format used in Purdue's LARSYS. 
The only difference in the second and third version of the for- 
mat is one word in the header record. That difference is 
transparent to this system. 

There are four types of (physical) records on the Multi- 
spectral Scanner Data Storage Tapes, They are: 


1. ID record - 200 full words fixed length 

2. Data record - variable length 

3. End-of-Tape records - 200 full words fixed length 

4. End-of-File records - IBM Standard 


A Multispectral Scanner Data Storage Tape contains one or 
more data runs consisting of an ID record, several data 
records and an End-of-File record. After the last data 
record on the tape, an End-of-Tape record and two End-of-File 
records are written on the tape. 


For the purposes of th*,: presentation, a 'word* is defined 
to be 32 bits and a byte to be 8 bits . Further details 
regarding the physical records follows: 




1. ID record (200 full words fixed length) 


ID (11 

ID (2) 
ID (3) 


ID (4 ) 


ID (5) 
ID ( 6 ) 
ID (7- 


FORMAT DESCRIPTION 


10 ) 


Integer 

Integer 

Integer 


Integer 


Integer 

Integer 

Alpha- 

numeric 


LARS Tape Number (e.g., 1, 17, 

r 

102, etc.) 

File number on this tape 
Run number (8 digits aabbbbcc) 
aa - last 2 digits of the Year 
data was acquired 
bbbb - running serial number for 
the year data was taken 
cc - uniqueness digits for runs 
which would otherwise have 
the same run number 
Continuation Code 
ID (4) = 0 means the first line of 
data follows this ID 
record 

ID (4) = X means that the data 
following this ID 
record is a continuation 
of a flight line started on 
tape X 

Number of Data Channels (Spectral 
bands) on tape (30 maximum) 

Number of Data Samples per channel 
per scan line 

Flightline Identification (16 
characters) 


,/ 


(4A4) 



FORMAT 

DESCRIPTION 

ID (11) 

Integer 

Month data was taken 

ID (12) 

Integer 

Day data was taken 

ID (13) 

Integer 

Year data was taken 

ID (14) 

Alpha- 

numeric 

C1A4) 

Time data was taken 

ID (15) 

Integer 

Altitude of aircraft 

ID (16) 

Integer 

Ground heading of aircraft 

ID (17-19) 

Alpha- 

Date data run was generated on 


numeric 

(3A4) 

this tape (12 characters) 

ID (20-50) 

Integer 

All zero (to be defined later) 

ID (51) 

Real 

Lower limits in micrometers of 
first spectral band on tape 

ID (52) 

Real 

Upper limits in micrometers of 
first spectral band on tape 

ID (53) 

Real 

The suggested value of "CO" 
calibration pulse 

ID (54) 

Real 

The suggested value of "Cl" 
calibration pulse. 

ID (55) 

Real 

The suggested value of "C2" 
calibration pulse. 

ID (56-200) 

Real 

Repeat of ID (51-55) for ID (5) 
channels in order of appearance 
in Data Records. 

ID (51-200) 

Real 

= 0.0 if Data Channels do not 


exist. 



Data Record: 


Each data record will contain one scan line of data from 
ID (5) (see ID Record) channels. The first half word 
(2 bytes) of the record will be the record number. The 
second half word (2 bytes) will be the roll paramater 
which is a number indicating relative roll of the 
aircraft for this scan line of data. If the roll para- 
meter is -32,767, the data for the given line does not 
exist. If the roll parameter has not been calculated, it 
will be set to 32,767. The fifth byte will be the first 
data sample from the first channel. The data samples are 
ordered: Channel^ Sample^ - Sample N ; Channel^ Sample^- 

Sample N ; and so on through ID (5) , Channels and ID (6) 
data samples per channel. A data record (scanline) will be 
ID ( 5 ) * ID ( 6 ) +4 bytes long . 

The data from each channel will be from the field of view 
of the scanner except the last six bytes. The last six 
are calibration data in the order of appearance: 


1. 

C 0 

"O'* or dark 

level 


2. 

VC 0 

Variance of 

c o 


3. 

Cl 

Calibration 

source 

C 

4. 

VC 1 

Variance of 

C 1 


5. 

C 2 

Calibration 

source 

C 

6. 

VC 2 

Variance of 

C 2 



where = Calibration value i and VC^ = calculated variance 
of calibration value i. 
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On good data records all 8-bit data and calibration values will 
be integers in the range of 0 uo 255 with no sign included 
in the eight bits. A sample data value of 0 to 255 is the 
result of the 8-bit analog-to-digital conversion which pro- 
duces the multispectral scanner data tape. With 8-bit A/D 
conversion, data values range between 0 to 255 with 0 
usually indicating low relative irradiance and 255 usually 
indicating high relative irradiance. 


l',nd-of-Tape Record : 


The End-of-Tape Record is very similar to the ID Record with 
200 full words in the following format. 


WORD 


FORMAT DESCRIPTION 


ID (1) 

Integer 

ID (2) 

Integer 

ID ( 3) 

Integer 

ID (4) 

Integer 


ID (5-50) Integer 

ID (51-200) Real 


LARS tape number 
File number on this tape 
Set equal to zero 
Continuation Code 
ID (4) = 0 means end of data 
ID (4) = X means data in previous 
file is continued on 
tape X 

All zero (may be defined later) 
0.0 (may be defined later) 
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APPENDIX C 

UNIVERSAL FORMAT FOR AN MSS DATA TAPE 

This is an adaptation of the Universal Data Tape Format as 
defined in the Earth Resources Data Format Control Book, Vol. 1 
(ref. 12, section 7). 


Ground Rules 


The ground rules for the UNIVERSAL format as accepted by 
all the processors within this system are as follows: 


• 8 bits = 1 byte 

• The header record is the first record on a tape. 

• The header record is fixed length equal to 3060 bytes. 

• Data following the header will be arranged by data sets, 
where a data set is defined as the ancillary data and 
all of the video data for one scan line for all active 
channels . 

• Data sets will be recorded in variable length physical 
records, not to exceed 3000 bytes of information per 
record. Note, since 3000 bytes is not compatible with the 
word lengths of all computers, the computer generating 
the tape will add a sufficient number of fill zeros to 
the end of the data to make the record length divisible 
by 32, 36, 48, and 60 bits (180 bytes). Therefore, it 

is possible to have a max physical record length of 3060 
bytes, but under no condition will data exceed 3000 bytes. 
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• Data sets will be packed into consecutive physical records 
of equal length. Under no condition will a data set begin 
in the middle of a physical record unless the data set 
can be completed in that record. If two or more records 
are needed for the data set, the data set will be divided 
but under no condition will the data for a video channel 
begin in the middle of a pnysical record unless the data 
for that video channel can be completed in that record. 
Consequently, data sets which are lengthy will be divided 
so that the ancillary block and video data from an inte- 
gral number of channels will be in one record and remain- 
ing video data will follow in succeeding records with an 
integral number of channels per record. Fill zeros will 
be supplied at the ends of the records as required to 
satisfy the equal length contraint noted above. 

• All data in the header record and ancillary blocks will 
be in binary. 


• The tape format will be as follows : 

Header Record 
IRG* 


Ancillary Block 
Video Block 


Data Set 


IRG** 


Ancillary Block 
Video Block 


Data Set 


IRG 


EOF 


* IRG = Inter Record Gap - always follows the header record. 
** An IRG may appear between the ancillary block and the video 
block so that the recording of a data set requires more than 
one physical record; or a physical record may contain two or 
more data sets, not separated by any IRG. See ground rules 

above and data set description following for criteria deter- 
mining the placement of IRG's. 
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Reader Record 


Although the header record is 3060 bytes in length, only a 
portion of the information is pertinent to the system at this 
time. A general description of the data that is unpacked 
by the TAPHDR routine is as follows: 


NO. OF 

BYTE NO. DESCRIPTION BYTES 

89 Processing flag 1 

0 = Raw Data 

1 = Processed data from computing system 

90 No. of channels in this job 1 

91 No. of bits in a picture element 1 

(Must be 8 at this time) 

92-93 Address of start of video data 2 


gives location of start of video within 
scan. 


96-97 

100-101 

102 


No. of video elements per scan within a 2 

single channel. 

Physical record size in bytes 2 

This number must be a multiple jf 180 bytes. 

No. of channels per physical record 1 

This field refers to the second and sub- 
sequent records within the recording of 
a data set. Bytes 1785-1786 give the 
number of channels of data in the first 
record of a data set. 

If no. elements per channel greater than 
3Kj this field rill equal 0 . 


C-T 

"? 6 " X 


BYTE NO 


DESCRIPTION 


NO. OF 
BYTES 


103 

104 

105-106 

107 

103-109 

110-111 

1778 

1785-1786 

1787-1788 


No. of physical records per scan 1 

per channel. This field is used only 
when the no. of elements per channel 
is greater than 3K. Otherwise it is 
equal to 0 . 

No. of records to make a complete data set 1 

Length of ancillary block in bytes 2 

Data Order Indicator 1 

0 = Video ordered by channel. 

1 = Video ordered by pixel 

Start Pixel No. 2 

Number of the first pixel per scan on 


this tape referenced to original image. 
The first pixel in the original image 
is pixel number one. 


Stop Pixel No. 2 

Number of the last pixel per scan on 
this tape referenced to original image. 

Number of Data Sets per Physical Record 1 

Number of channels in the first physical 2 

record of the data set 

Total number of bytes per scan per channel 2 


Data Sets 
Ancillary Block 


The first block of a data set is the ancillary block. 

The length of the ancillary block is variable, with the 
number of bytes given in the header record. 


• The first word (2 bytes) of every record is a counter 
givxng the number of the physical record within the video 
data set. This is primarily intended for use in data sets 
that are greater than 3000 bytes long and therefore require 
more than one physical record for recording. This word 
will always equal "1" for the first record of a data set. 

• Bytes 3 through 6 will contain the current GMT at the start 
of this data set recorded in tenths of milliseconds. 

• Bytes 7 through 70 will indicate channel status for this 
scan, one byte per channel, where the LSB = 0 indicates the 
channel is sync, and the LSB = 1 indicates the channel not 
in sync. 

• 71-72 contain the scan line number. This will be an arbi- 
trary but sequential count for each scan line that appears 
in the data run. 

• Bytes 73 through N will be dependent on whether this job 
contains raw or processed data. (See byte 89 in the header 
record.) The lue of N will be given in bytes 105 and 
106 in the header recox^d and will always be equal to or 
greater than 70. 

• If this job contains raw data, bytes 73 through N will 
contain the housekeeping data channel from the sensor, 
if one is available. 

• A job containing processed data will, in addition to the 
70 bytes of ancillary data already described, contain, 
at a minimum, the following pieces of information: 

• Latitude of the aircraft or of the center of the image 
from EREP or satellite in binary. 

• Longitude of the aircraft or of the center of the imace 
from EREP or satellite in binary. 
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• Altitude in meters recorded in binary. 

• Heading i:a tenths of a degree. 

• Ground speed in meters per second. 

• Roll - Defined in specific formats, following. 

• Pitch - Defined in specific formats, following. 

• Yaw - Defined in specific formats, following. 

• Sun angle. 

The specific formats for each sensor (following in this 
section) shall provide where this data will appear in 
the format. 

Other parameters may be added, if reguired, with the length 
of the ancillary block given in the header. 

Video Data 

• Following the ancillary block in each data set will be the 
video data from all of the active channels for one scan. 

The video data from all of the active channels for one 
scan will comprise a video block. 

• Video blocks within a data run will always contain the 
same number of video channels. 

• Each video block will be the same number of bytes in length. 
If video data is not available to fill a block, fill zeros 
will be added to make it the same length as preceding video 
blocks . 

• Video data less than 8 bits per pixel will be packed, right 
justified, in an 8-bit byte with zeros added to the left. 




• Video data greater than 8 bits per pixel will be packed, 
right justified, in as many 8-bit bytes as necessary to 
hold the pixel, with zeros added to the left. 

• If this tape contains raw data, the PCM sync words asso- 
ciated with the video data, if any, will be included with 
the video data on this tape. If this tape contains pro- 
cessed data, no sync words will be present. 

• Calibration data that is associated with each scan within 
each channel will be included, if this tape contains raw 
imagery data, in the came sequence as it appears in the 
data stream on the flight tape. If this tape contains 
processed imagery data, the appearance of the calibration 
data will depend on the specific sensor requirements and 
will be specified in the respective format following in 
this document. 

• The combined length of the ancillary blocx and the video 
block will determine the relationship between data sets 
and physical records. Some data runs may contain data sets 
which are so small more than one can be packed into one 
physical record. Others may contain data sets which will 
require a whole physical record for each. Still others 
may contain data sets which are so long that each data 

set will require two or more physical records. 

• Data sets will be packed in physical records depending on 
the length of the data set. The ancillary block will always 
appear in the first physical record per data set. Follow- 
ing the ancillary block, as many complete channels in this 
data set will be recorded as will fit in up to 3000 bytes. 

If the data set is too long to be recorded in one physical 
record, the second and subsequent records will begin with 
the next active channel in the data set. 
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• If a video block is divided between more than one record 
for recording, the number of data channels in the first 
record may vary from the number of channels in the second 
and successive records; however, the number of channels in 
all records following the first per data set will always 
be the same. The number of channels in che first record and 

r 

the number in successive records will be given in the header 
record. In records following the first, if video data is 
not available so as to allow all records to contain the 
same number of channels, fill zeros will be added in lieu 
of the video data in order to make all records the same 
length. In addition, fill zeros will be added to either 
the first record or all of the successive records, depend- 
ing on whj.ch is shorter, so as to make all of the records 
the same length. 


• The only arrangement of pixels within a scan of data will 


be by channel. The Universal format* will be as follows: 


I 


LINE 1 


CHANNEL 1 


CHANNEL 2 


PIXEL 1 
PIXEL 2 


PIXEL N 
/ PIXEL 1 
PIXEL 2 


PIXEL N 


CHANNEL M 


PIXEL 1 
% 

PIXEL N 


* If this tape contains raw imagery data, the 
any, that are associated with the data on the 
included with the data. 

C-8 


PCM sync words, if 
flight taoe will be 


LINE 2 





CHANNEL 1 


CHANNEL M 


J ^TXEL 1 

) • 

| : 

l PIXEL N 


PIXEL 1 


PIXEL N 
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APPENDIX D 


MAPTAP FILE FORMAT 

The file, MAPTAP, is output by the processor CLASSIFY. It 
contains the statistics actually used in classification, the 
training field, category, class, and subclass information, 
and the classified data. 

r 

Each file consisting of the following types of records: 

run header records 
field header record 
data records 
end-of-field record 
end-of-run record 
of- file mark 

All records are written with a nonformatted FORTRAN write 
statement . 


4 - 

'l. - 

Repeated for each ^ N 
classified field 1 

u - 
1 - 

end- 


RUN Header Record 1 


WRITE (MAPTAP) (DATE (I) ,1=1,2) , BMFLG , BMCOMB , BMFEAT , NOCLS 2 , 
N0FLD2 , NOSUB 2 ,N0FET2 ,T0TVT2 , NOCAT , VARS Z 2 , 
(FETVC'2 (I) , 1=1 ,N0FET2) 

FORTRAN NAME 

AND DIMENSION DESCRIPTION 


DATE (2) 

Date classification was performed 

BMFLG 

Flag indicating B-MATRIX was used 
in classification 

BMCOMB 

No. of linear combinations in 
B-MATRIX 


y/s 


FORTRAN NAME 
AND DIMENSION 


DESCRIPTION 


BMFEAT 

NOCLS2 

NOFLD2 

NOSUB 2 

NOFET2 

TOTVT2 

NOCAT 

VARS Z 2 

FETVC2 (NOFET2) 


No. of channels used in computing 
the B-MATRIX 

No. of classes 

No. of training fields 

No. of subclasses 

No. of channels used in classification 
No. of vertices in training fields 
No. of categories 

Size of covariance for each subclass 
Actual channels used in classification 


RUN Header Record 2 


WRITE (MAPTAP) (CATNAM(I) ,I=1,N0CAT1) , (CLSMTX(I) ,I=1,N0CLS2) , 
(SUBNO(I) , 1=1 ,N0CLS2) , (SUBDES(I) ,I=1,N0SUB2) , 

( (FLDMTX (I , J) ,1=1 ,4) ,J=1,N0FLD2) , ( (VFRTEX(I,J) , 
1=1,2) ,J=1,T0TVT2) , (SUBCAT (I) ,I=1,N0SUB2) , 
(CLSVC2 (I) ,I=1,N0SUB2) , (KATNO (I) ,I=1,N0CLS2) , 
(KEPPTS (I) ,I=l,NOSUB2) 


•FORTRAN NAME 

AND DIMENSION DESCRIPTION 


CAT NAM (NOCAT 1) Category names (if available) 

N0CAT1 = no. of categories if 
CATEGORY classifier was applied 
N0CAT1 = no. of classes if STANDARD 
classifier was applied. 

Class names 


CLSMTX (NOCLS 2 ) 


FORTRAN NAME 

AND DIMENSION DESCRIPTION 


SUBNO (NOCLS2) 
SUBDES (NOSUB2) 
FLDMTX ( 4 , NOFLD2 ) 


VERTEX ( 2, TOTVT2) 

SUBCAT (NOSUB 2) 

CLSVC2 (NO SUB 2) 
KATNO (NOCLS2) 
KEPPTS (NOSUB2 ) 


No. of subclasses in each class 
Subclass names 

Training field information 

1 - field name 

2 - Class number field belongs to 

3 - Subclass number field belongs to 

4 - No. of vertices in this field 

Vertices for all the fields; ordered 
(sample , line ) ^ , ( sample , line ) 2 / • • • 
(sample , line) T0TVT2 

Contains the category number to 
which each corresponding subclass 
belongs 

Contains the class number to which 
each corresponding subclass belongs 

Contains the category number to 
which each class belongs 

Contains the total number of train- 
ing field pixels in each subclass 


RUN Header Record 3 

WRITE (MAPTAP) ( (COVMTX (I , J) ,I=1,VARSZ2) ,J=1,N0SUB2) 
( (AVEMTX(I,J) ,I=1,N0FET2) ,J=1,N0SUB2) 


FORTRAN NAME 

AND DIMENSION DESCRIPTION 

COVMTX (VARS Z2 ,N0SUB2) Original or B-transformed covariance 

matrix for each subclass 

AVEMTX (N0FET2 ,N0SUB2) Mean vector for each subclass 




Run Header Record 4 


WRITE (MAPTAP) ( (COVMTX(I,J) ,I=1,VARSZ2) ,J=1,N0SUB2) , 

(CON (I) ,I=1,N0SUB2) , (DET(I) ,I=l,NOSUB2) 

FORTRAN NAME 

AND DIMENSION DESCRIPTION 

COVMTX (VARSZ2 , NOSUB 2) 'Modified' Cholesky Factorization 

of the covariance matrix for each 
subclass 

CON (NOSUB2) Subclass constants 

DET(NOSUB2) Determinant of covariance matrix 

for each subclass 

Field Header Record 

WRITE (MAPTAP] (FLDINF (I) ,1=1,6) , PTS , LINES ,FLDESC ,NC, 

(VERTCS (I) ,1=1, NC) , (VERTCS(I+NC) ,NC=1,NC) 

FORTRAN NAME 

AND DIMENSION DESCRIPTION 

FLDINF (6) Rectangular coordinates surrounding 

the field classified. 

1 - line start 

2 - line stop 

3 - line increment 

4 - sample start 

5 - sample stop 

6 - sample increment 

No. of points in the rectangular 
field defined in FLDINF 



PTS 


FORTRAN NAMES 
AND DIMENSION 

LINES 

FLDESC 

NC 

VERTCS (2 ,NC) 


DESCRIPTION 


No. of lines in the rectangular 
field defined in FLDinF 

Name of the classified field 

No. of vertices in the classified 
field 

Vertices for the classified field. 
Vertices are ordered (sample, line)^, 
(sample, line) 2 /»** (sample, line) NC 

Data Record 


WRITE (MAPTAP)ILINE, (IR(I) ,I=1,PTS) , (VR(I) ,1-1, PTS) 

FORTRAN NAME 

AND DIMENSION DESCRIPTION 

ILINE Line number in reference to the 

multispectral scanner data tape 

IR(PTS) Subclass number to which each 

classified data point belongs 

VR(PTS) Likelihood that the point belongs 

to that subclass 

End-of-Field Record 

An end-of-fie 1 record has the same format as a data record 

with ILINE=0 . 

End-of-Run Record 


An end-of-run record has the same format as field header 
record with PTS=0 . 
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• APPENDIX E 
NHSTUN FILE FORMAT 


The inter^dce .tile written to the NHSTUN unit is output by the 
NDHIST processor and read by the 3CTRPL processor. 

All records are written with a nonformatted Fortran WRITE state- 
ment. The header record is always the first file on the tape*. 


The format of the tape is as follows: 


File 1 


HEADER RECORD 
END OF FILE (EOF) 


Data file 


1 < 


RECORD 

RECORD 

RECORD 

RECORD 

RECORD 

RECORD 

RECORD 

EOF 


1 

2 

3 

4 

5 

6 
7 


(optional) 


(optional) 


Data file N EOF 


The contents of each record are as follows: 


TOTMNS 

SIZE 

N0FET2 

(FETVC2 (I) , 1=1 ,NOFET2) 
NCLRCH 

(CLRVEC (I) ,1=1, NCLRCH) 


Header Record 

— Total number of means computed 
- NOFET2/4 

— Number of channels to histogram 
— Actual channels to histogram 
— Number of color code channels 
— Actual color code channels 



_> /u 


C 


Record 1 


N0FLD2 

NOSUB2 

T0TVT2 

NOVEC 


CLSVC2 
(SUBVC2 (I) , 
( (FIELDS (I, 
( (VERTEX ( I , 

(MEANS (I) ,1 

( (PLOT (I , J) 

(ID (I) ,1=1, 


( COUNTR ( I ) , 


- Number of fields histogrammed 

— Number of subclasses histogrammed 

— Number of vertices 

— Number of unique vectors 
histogrammed 

Record 2 

— Class name 

1=1 ,N0SUB2) — Subclass names 

J) ,1=1,4) ,J=1,N0FLD2) —Field information 
J) ,1=1 ,2) , J=1,T0TVT2) —Field vertices 

Record 3 (optional) 

=1 , TOTMNS ) — Mean stats for input fields 

Record 4 

, 1=1, SIZE) ,J=1, NOVEC) - Data vectors 

Record 5 

NOVEC) — Class/subclass/field the data 

vectors belong to 

Record 6 

1=1, NOVEC) — Number of occurrences of the 

data vectors 

Record 7 (optional) 

— Color codes extracted from MSS 
data tape 


(COLOR (I) ,1=1, NOVEC) 
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- APPENDIX F 

DESCRIPTION OF CLUSTER IMAGE DISPLAY 
WITH COLOR KEYS 

The cluster image data tape output by the ISOCLS processor con- 
tains the mean vector to which each corresponding pixel was 
assigned during clustering and a color key. The color key is 

r 

an n number of square images of 10 samples by 10 lines in dimen- 
sions. A color code square is composed of the mean vector .r:or a 
given cluster. The color codes are ordered according to cluster 
number or greenness. The greenness ordering (G) is a function 
of the four Landsat channels : 

G i / N = "°* 29y l,N " °* 56 ^2,N + °* 60y 3,N + °* 49M 4,N 
M 

G N = 2 G i f N 
i=l 



where 

M = number of passes for multiregistered Landsat data 

y^ = first channel of pass i 

]±2 - second channel of pass i 

y^ = third channel of pass i 

y^ = fourth channel of pass i 

i = number of pass 

N = cluster number 

The number of color codes per scan line is computed by 

T , _ number of samples per scan line 
K 11 


(E-2) 


The number of lines requited to display the color codes is 
computed by 

(number of clusters - 1) 

K 

The clustered field and color key are separated by a scan line of 
zeros. Each color code square is separated by a l-by-10 vertical 
line of zeros. 

r 

The data are output in LARSYS III or Universal format (see 
appendixes A and B, respectively) . 

The structure of the file is as follows. 





Inter- record gap. 
'End of file. 
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APPENDIX G 


SCTRUN FILE FORMAT 

The scatter plot image, written to the SCTRUN unit, contains two- 
axis coiorcoded spectral plot(s) an 1 is output in the Universal 
format (appendix B) by the SCTRPL processor. Each file of the 
multifile tape contains (1) a single scatter plot image, of 
which N - 1 channels are color assignments and the Tilth channel 
is the frequency channel, and (2) a color key, unless the color 
assignment is the radiance values of the output pixel. 


The color key is an N number of square images dimensioned 
10 samples by 10 lines. A color code square is composed of the 
colors assigned to a given pixel. Each color code is ordered 
according to its cluster association; i.e., the color code 
associated with cluster 1 is output first, followed by the color 
code associated with cluster 2, etc. The number of color codes 
per scan line is computed by 


T , _ number of samples per scan line 

11 


(F-l) 


The number of lines required to display the color codes is 
computed by 

L = ^ (number of clusters - 1) + ij x n 


(F-2) 


The scatter plot image and color key are separated by a scan 
line of zeros. Each color code square is separated by a l-by-10 
vertical line of zeros. 


The dimensions of the output tape are user controlled via the 
control cards SIZE and CHANNELS (see section 16, table 16-1, and 
section 15, table 15-1, respectively) , where 

(1) number of samples per scan lines = XSIZ 

(2) number of channels = dimensions of color pixel plus the 

frequency channel 

(3) number of scan lines = YSIZ + L 

The structure of the file is as follows: 

HEADER RECORD 
IRG 1 

YSIZ records — scatter plot image 

IRG 

or 

EOF (if color key is omitted) 

(YSIZ + 1) records — scan line of zeros 

IRG 

0 0 

0 0 

color 0 0 color 

code 2 • • code K 

• • 

• • 

0 0 

. ' v 

IRG 


(YXIZ + 2) records - 10 lines ^ 


0 
0 

color 0 
code 1 

• 
0 

I — ...y / 

10 samples 


(YXIZ + L) records — color code (last cluster) 

E-O-F 2 


^"Inter-record gap. 
2 End of file. 
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DOT DATA FILE FORMAT (DOTUNT) 


The tape, written on the DOTUNT unit, is output by the DOTDATA 
processor. The records are written with an unformatted FORTRAN 
write . 


A file is output for each TYPE of field (s) . The file consists 
of the following records : 


Repeat for 
each TYPE 


Rec . No . 
Rec . No . 


Rec . No . 


E-O-F 



3f 


field information 


data record 


TYPE 1 consists of labeling dots; TYPE 2 consists of bias 
correction dots. 

Rec. No. 1 

WRITE (DOTUNT) NOCAT, NOFEAT, NOFLD , TOTVRT , TOTDOT , NOSUN, 

(CATNAM(I) , 1=1, NOCAT), SIZE 

Parameter Dimension Definition 


NOCAT 

1 

Number- 
names . 

of 

category 

NOFEAT 

1 

Number 

of 

channels . 

NOFLD 

1 

Number 

of 

fields . 

TOTVRT 

1 

Number 

of 

vertices . 

TOTDOT 

1 

Number 

of 

dots . 

NOSUN 

1 

Number 

of 

sun angles 




Parameter 

Dimension 

Definition 

CATNAM 

NOCAT 

Array containing the 
category names. 

SIZE 

1 

Rec. No. 2 

4 + NOFEAT 

WRITE (DOTUNT) 

(FETVEC (I), 1=1, NOFEAT) 

( (FLDSAV (I, J) , 1=1,4), J=l, NOFLD), 
( (VERTEX (I, J) , 1=1,2), J=1 , TOTVRT ) , 
(ANGLE (I), 1=1, NOSUN) 

Parameter 

Dimension 

Definition 

FETVEC 

NOFEAT 

Array containing the 
channel numbers . 

FLDSAV 

(4, NOFLD) 

Array containing the 
field description. 

VERTEX 

(2, NOFLD) 

Array containing the 
field vertices. 

ANGLE 

NOSTJN 

Rec . No . 3 

Array containing the 
sun angles. 

WRITE (DOTUNT) 

((DOTS (I, J), 1=1, SIZE) 

, J=l, TOTDOT) 

Parameter 

Dimension 

Definition 

DOTS 

(TOTDOT, SIZE) 

Array containing the 
dot information, 

DOTS (1 , I) =sample number 
for dot i. 

DOTS (2 , I) =line number 
for dot i. 



Parameter 


Definition 


Dime .-; 3 ion 

DOTS (3,1) -type number 
for dot i. 

DOTS (4,1) -category 

number for 
dots i 
(optional). 

DOTS (5,1) 

• =dot 

• vector i* 

DOTS (4+NOFEAT ,1) 
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. APPENDIX I 
BASELINE RUN 

This run is published in volume I, section 20, and is not repeated 
here. 
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